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multivalent conjugate vaccine against GBS based on the capsule polysaccharides of the clinically 
relevant serotypes (Paoletti et al, 1999; Baker et al, 1999; Baker et aL, 2000; Paoletti and 
Kasper, 2002). However, there are a number of technical difficulties to overcome with capsule- 
containing conjugate vaccines: multiple serotypes are needed, an appropriate protein conjugate 
needs to be identified and validated, and potential cross-reaction with human tissues needs to be 
addressed (Korzeniowska-Kowal et aL, 2001). The use of cell surface proteins from GBS 
represents an attractive alternative to capsule polysaccharides for the development of a vaccine 
against these bacteria. The surface proteins Sip, Rib, a and P from GBS have already been 
showoi to confer protective immunity in mice against GBS infections (Madoff et aL^ 1992; 
Larsson et al., 1997; Larsson et aL, 1999; Brodeur et al, 2000). Also two unique surface proteins 
from a serotype V strain were shown in a mouse model to protect against GBS infection 
(Areschoug et al, 1999), Finally, antibodies against C5a peptidase from GBS were found to 
inititate macrophage killing of the bacteria (Cheng et al^ 2001). 

The interaction of GBS with its host is a complex process involving the colonization and 
penetration of epitheUal and endotheUal surfaces and the evasion of the immune defence 
(Spelleiberg, 2000). In streptococci, fibrinogen binding has been shown to play a significant role 
in the adhesion to host surfaces (Courtney et aL, 1994; Cheung et al, 1991; Ni et al, 1998; Pei 
and Flock, 2001) and the protection from the immune system (Courtney et al^ 1997;Them et al, 
1998; Ringdahl et aL, 2000). Therefore, several studies have addressed the molecular basis of 
fibrinogen binding in streptococci of the serological groups A, C and G (Fischetti, 1989; Meehan 
et aL, 1998;Vasi et al, 2000). 

Fibrinogen is a 330 kDa glycoprotein found in high concentrations in blood plasma (Fuss et a/., 
2001; Mosesson et aLy 2001). It is a hexamer composed of each of two Aa-, BP-, and 7-chains 
linked together by disulfide bonds. Fibrinogen is a key player in haemostasis and mediates 
platelet adherence and aggregation at sites of injury. Furthermore, it is cleaved by thrombin to 
form fibrin, which is the major component of blood clots. Fibrinogen also plays a role in 
opsonophagocytosis. It has been shown to inhibit the binding of the activated complement factor 
C3b, thereby blocking the activation of the alternative complement pathway (Whitnack et aL, 
1984; Whitnack and Beachey, 1985). The newborn's unique susceptibility for disseminated GBS 
infections has been associated with a relative complement deficiency (Mills et aL, 1979; 
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Edwards et aly 1983). Fibrinogen binding of GBS may thus play an important role in the 
^ inhibition of the residual complement activity in the newborn (Noel et aL, 1991). 

In several studies, the interaction of GBS with human fibrinogen has been demonstrated 
(Schonbeck et aL, 1981; Lammler et a/., 1983; Chhatwal et a/., 1984; Spellerberg et aL, 2002). 
However, the molecular basis of fibrinogen binding in GBS remained unknown. 

GBS has been demonstrated to bind to and invade epithelial and endothelial cells (Gibson et aL, 
1993; La Penta et aL, 1997; Winram et aL, 1998). Treatment of GBS with the protease trypsin 
abolishes the adhesive and invasive properties of the bacteria (Valentin- Weigand and Chhatwal, 
1995; Winram et aL, 1998), indicating a proteinacious nature of the adhesins and invasins in 
GBS. As adhesins and invasins are located on the surface of the bacteria and are important for 
the viralence of GBS, they represent ideal targets for the development of a GBS vaccine. 

The problem underlying the present invention was to provide means for the development of 
medicaments such as vaccines against bacterial infections. More particularly, the problem was to 
provide new adhesions factors of GBS which can be used for the manufacture of said 
medicaments. 

The problem is solved in a first aspect by an isolated nucleic acid molecule, preferably encoding 
a fibrinogen-binding-polypeptide or such protein or a fragment thereof, comprising a nucleic 
acid sequence which is selected from the group comprising 

a) a nucleic acid having at least 70% identity to a nucleic acid sequence which is selected 
firom the group comprising SEQ ID NO 1 to SEQ ID NO 6, 

b) a nucleic acid which is essentially complementary to the nucleic acid of a), 

c) a nucleic acid comprising at least 15 sequential bases of the nucleic acid of a) or b), 

d) a nucleic acid which anneals under stringent hybridisation conditions to the 
polynucleotide of a), b) or c), and 

e) a nucleic acid which, but for the degeneracy of the genetic code, would hybridize to the 
nucleic acid defined in a), b), c) or d). 
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The problem is solved in a second aspect by an isolated nucleic acid molecule, preferably 
encoding an adhesion factor or a fragment thereof, comprising a nucleic acid sequence which is 
selected from the group comprising 

a) a nucleic acid having at least 70% identity to a nucleic acid sequence set forth in SeqID 
NO 7, SeqID NO 8, SeqID NO 9 or ScqlD NO 10. 

b) a nucleic acid which is essentially complementary to the nucleic acid of a), 

c) a nucleic acid comprising at least IS sequential bases of the nucleic acid of a) or b), 

d) a nucleic acid which anneals under stringent hybridisation conditions to the nucleic acid 
of a), b) or c), and 

e) a nucleic acid which, but for the degeneracy of the genetic code, would hybridize to the 
nucleic acid defined in a), b), c) or d). 

In an embodiment of both aspects of the present invention the identity is at least 80 %, preferably 
at least 90 %, more preferably 100 %. 

In a further embodiment of both aspects of the present invention the nucleic acid is DNA, 

In a still further embodiment of both aspects of the present invention the nucleic acid is RNA. 

In a preferred embodiment of both aspects of the present invention the nucleic acid molecule is 
isolated from a bacterium. 

In a more preferred embodiment of both aspects of the present invention the bacterium is a 
species selected from the group comprising Streptococci, Staphylococci and Lactococci. 

In an even more preferred embodiment of both aspects of the present invention the bacterium is a 
species which is selected from the group comprising Streptococcus agalactiae. Streptococcus 
pyogenes. Streptococcus pneumoniae and Streptococcus mutans. 

In a most preferred embodiment of both aspects of the present invention the bacterium is 
Streptococcus agalactiae. 
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In an embodiment of the first aspect of the present invention the nucleic acid molecule encodes a 
fibrinogen-binding-protein comprising at least one repeat of an amino acid motive comprising 16 
amino acids. 

bi an embodiment of the second aspect of the present invention the nucleic acid molecule 
encodes an adhesion factor which interacts with epithelial cells. 

In a preferred embodiment of the first aspect of the present invention the encoded fibrinogen- 
binding-protein comprises 19 repeats of the amino acid motive whereby the amino acid motive is 
any one of the ones specified or disclosed herein. 

In a more preferred embodiment of the first aspect of the present invention the repeats are 
encoded by a polynucleotide selected fi^ora the group comprising SEQ ID NO 21 to SEQ ID NO 
112. 

In a third aspect the problCTi underlying the present invention is solved by an isolated nucleic 
acid molecule comprising a nucleic acid sequence, whereby the nucleic acid sequence is selected 
fiom the group comprising SEQ ID NO 21 to SEQ ID NO 21 to 1 12. 

In a fourth aspect the problem underlying the present invention is solved by an isolated nucleic 
acid molecule encoding for a polypeptide whereby the polypeptide comprises an amino acid 
motive, whereby the amino acid motive is G-N/S/T-V-L-A/E/M/Q-R-R-X-K/R/W-A/D/E/N/Q- 
AiT/I/lWA^-X-X-K/R-X-X (SEQ ID NO 222). 

In a preferred embodiment of any of the aspects 1 to 4 of the present invention the nucleic acid is 
DNA, RNA or mixtures thereof, preferably the nucleic acid molecule is isolated from a genomic 
DNA. 

In a fifth aspect the problem imderlying the present invention is solved by a vector comprising a 
nucleic acid molecule according to any aspect of the present invention. 

In a preferred embodiment the vector is adapted for recombinant expression of the polypeptide 
encoded by any of the nucleic acid molecules according to any aspect of the present invention. 
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^ In a sixth aspect the problem underlying the present invention is solved by a cell comprising the 
vector according to the present invention. 

In a preferred embodiment the cell is a host cell. 

In a seventh aspect the problem underlying the present invention is solved by a polypeptide, 
preferably a fibrinogen-binding-polypeptide and/or an adhesion factor, comprising an amino acid 
sequence, whereby the amino acid sequence is encoded by a nucleic acid molecule according to 
any aspect of the present invention, and fragments of said polypeptide. 

In an eighth aspect the problem underlying the present invention is solved by a polypeptide, 
preferably a fibrinogen-binding-polypeptide and/or an adhesion factor, comprising an amino acid 
sequence, whereby the amino acid sequence is selected from the group comprismg SEQ ID NO 
11 to SEQ ID NO 20. 

In an embodiment of this aspect of tiie present invention the polypeptide, preferably a 
fibrinogen-binding-polypeptide and/or an adhesion factor, having an amino acid sequence 
according to any of SEQ ID NO 1 1 to 16 is a fibrinogen-binding protein. 

In a further embodiment of this aspect of the present invention the polypeptide is an adhesion 
factor which interacts with epithelial cells. In an even more preferred embodiment the epithelial 
cells are human epithelial cells. 

In a ninth aspect the problem underlying the present invention is solved by a polypeptide 
comprising an amino acid sequence, whereby the amino acid sequence is selected from the group 
comprising SEQ ID NO 113 to SEQ ID NO 205. In an embodiment the polypeptide comprises at 
least one of the amino acid sequence according to SEQ ID NO 113 to SEQ ID NO 225 in 
combination with at least one other amino acid sequence. More preferable this at least one other 
amino acid sequence is an amino acid sequence according to any of SEQ ID NO 1 13 to SEQ ID 
NO 205. 
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In a tenth aspect the problem underlying the present invention is solved by a polypeptide 
^ comprising an amino acid motive, whereby the polypeptide comprises an amino acid motive, 
whereby the amino acid motive is G-N/S/T-V-L-A/E/M/Q-R-R-X-K^/W-A/D/E/N/^^ 
A/F/I/iyVA^-X-X-K/R-X-X (SEQ ID NO 222). 

In an eleventh aspect the problem underlying the present invention is solved by a process for 
producing a polypeptide according to any aspect of the present invention comprising expressing 
the nucleic acid molecule according to any aspect of the present invention. 

In a twelfth aspect the problem underlying the present invention is solved by a process for 
producing a cell which expresses a polypeptide according to any aspect of the present invention 
or a fragment thereof, comprising transforming or transfecting a suitable host cell with the vector 
according to the present invention such that the transformed or transfected cell expresses the 
polypeptide encoded by the polynucleotide contained in the vector. 

In a thirteenth aspect the problem underlying the present invention is solved by a pharmaceutical 
composition, especially a vaccme, comprising a polypeptide or a fragment ttiereof, as defined in 
any aspect of the present invention or a nucleic acid molecule according to any aspect of the 
present invention. 

In a preferred embodiment the pharmaceutical composition comprises an immunostimulatory 
substance, whereby the immunostimulatory substance is preferably selected from the group 
comprising polycationic polymers, immunostimulatory deoxynucleotides (ODNs), synthetic 
KLK peptides, neuroactive compounds, alumn, Freund's complete or incomplete adjuvants or 
combinations thereof. 

In a preferred embodiment the immunostimulatory substance is a combination of either a 
polycationic anion and immunostimulatory deoxynucleotides or of synthetic KLK peptides and 
iiimiunostimulatory deoxynucleotides. 

In a more preferred embodiment the polycationic polymer is a polycationic peptide and/or 
whereby the neuroactive compound is human growth hormone. 
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In a fourteenth aspect the problem underlying the present invention is solved by the use of a 
polypeptide according to any aspect of the present invention or a fragment thereof for the 
manufacture of a medicament, especially for the manufacture of a vaccine against bacterial 
infection. 

In a preferred embodiment the bacterial infection is a bacterial infection of Streptococcus 
agalactiae. 

In a fifteenth aspect the problem underlying the present invention is solved by the use of 
molecules which inhibit the binding of a polypeptide according to any aspect of the present 
invention to fibrinogen for the manufacture of a medicament to prevent and treat bacterial 
infection. Preferably, the bacterial infection is a Streptococcus agalactiae infection. 

In a fiirther embodiment the molecules are selected from the group comprising fibrinogen 
receptor antibodies, fibrinogen receptor mimotopes and fibrinogen receptor antagonists binding 
to a polypeptide according to any aspect of the present invention.. 

In a sixteenth aspect the problem underlying the present invention is solved by the use of 
molecules which inhibit the binding of a polypeptide according to any aspect of the present 
invention to epithelial cells, preferably human epithelial cells. 

In a seventeenth aspect the problem underlying the present invention is solved by an antibody, or 
at least an effective part thereof, which binds at least to a selective part of the polypeptide or a 
fragment thereof according to any aspect of the present invention. 

In an embodiment the antibody is a monoclonal antibody. 

In a fiirther embodiment said effective part comprises Fab fragments. 

In a still fiirther embodiment the antibody is a chimeric antibody. 

In a preferred embodiment the antibody is a humanized antibody. 
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In an eighteenth aspect the problem underlying the present invention is solved by a hybridoma 
cell line, which produces the antibody according to the present invention . 



In a nineteenth aspect the problem underlying the present invention is solved by the use of the 
antibody according to the present invention for the preparation of a medicament for treating or 
preventing bacterial infections, especially Streptococcus agalactiae infections. 

In a twentieth aspect the problem underlying the present invention is solved by an antagonist 
which reduces or inhibits the activity of the polypq)tide or a fragment thereof according to any 
aspects of the present invention. 

In a twenty-first aspect the problem underlying the present invention is solved by a method for 
identifying an antagonist capable of reducing or inhibiting the activity of the polypeptide or 
fragment thereof according to any aspect of the present invention comprising: 

a) contacting an isolated or immobilized polypeptide according to any of the aspects of the 
present invention or a fragment thereof with a candidate antagonist under conditions to 
permit binding of said candidate antagonist to said polypeptide or fragment thereof, in 
the presence of a component capable of providing a detectable signal in response to the 
binding of the candidate antagonist to said polypeptide or fragment thereof; and 

b) detecting the presence or absence of a signal generated in response to the binding of the 
antagonist to the polypeptide or fragment thereof, preferably the presence of a signal 
indicating a compound capable of inhibiting or reducing the activity of the polypeptide 
or fragment thereof. 

In a twenty-second aspect the problem underlying the present invention is solved by a method 
for identifying an antagonist capable of reducing or inhibiting the activity of a polypeptide or a 
fragment thereof according to any the aspects of the present invention comprising: 

a) providing the polypeptide according to any aspect of the present invention or a 
fragment thereof, 
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b) providing an interaction partner of the polypeptide according to any aspect of the 
present invention, preferably the antibody according to the present invention, 

c) providing a candidate antagonist, 

d) reacting the polypeptide, the interaction partner of the polypeptide and the candidate 
antagonist, and 

e) determining whether the candidate antagonist inhibits or reduces the activity of the 
polypeptide. 

In a twenty-third aspect the problem imderlying the present invention is solved by a method for 
identifying an antagonist capable of reducing or inhibiting the interaction activity of the 
polypeptide according to the present invention or a fragment thereof to its interaction partner 
comprising: 

a) providing the polypeptide according to the present invention or a fragment 
thereof, 

b) providing an interaction partner to said polypeptide or a fragment thereof, 
preferably an antibody according to the present invention, 

c) allowing interaction of said polypeptide or fragment thereof to said interaction 
partner to form an interaction complex, 

d) providing a candidate antagomst, 

e) allowing a competition reaction to occur between the candidate antagonist and the 
interaction complex, and 

f) determining whether the candidate antagonist inhibits or reduces the interaction 
activities of the polypeptide or the fragment thereof with the interaction partner. 

In a twenty-fourth aspect the problem underlying the present invention is solved by an antagonist 
identified or identifiable by a method according to the twenty-second or twenty-third aspect of 
the present invention. 
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In a twenty-fifth aspect the problem underlying the present invention is solved by a process for 
in vitro diagnosis of a disease related to expression of the polypeptide or a fragment thereof 
according to any aspect of the present invention comprising determining the presence of a 
polynucleotide sequence encoding said polypeptide or the presence of a polypeptide according to 
any aspect of the present invention. 

In a twenty-sixth aspect the problem underlying the present invention is solved by a process for 
in vitro diagnosing a disease related to expression of the polypeptide according to the present 
invention or a fragment thereof, comprising determining the presence of a nucleic acid sequence 
encoding said polypeptide or a fragment thereof according to the present invention, or the 
presence of the polypeptide according to the present invention or a fragment thereof. 

In a twenty-seventh aspect the problem underlying the present invention is solved by a process 
for in vitro diagnosis of a bacterial infection, preferably Streptococcus agalactiae infection, 
comprising the step of determining the presence of a nucleic acid molecule according to any 
aspect of the present invention, or of a polypeptide according to any aspect of the present 
invention. 

In a preferred embodiment of the latter three aspects of the present invention the presence is 
determined in a sample which is preferably derived from a host organism. 

In a twenty-eighth aspect the problem underlying the present invention is solved by an affinity 
device comprising a support material and immobilized to said support material a polypeptide 
according to any aspect of the present invention or a nucleic acid molecule according to any 
aspect according to the present invention. 

In a twenty-ninth aspect the problem underlying the present invention is solved by the use of a 
polypeptide according to any aspect of the present invention for the isolation and/or purification 
and/or identification of an interaction partner of said polypeptide. 
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In a thirtieth aspect the problem underlying the present invention is solved by the use of any of 
the polypeptides according to any aspect of the present invention for the generation of a peptide 
binding to said polypeptide. 

In a preferred embodiment the peptide is selected from the group comprising anticalines. 

In a thirty-first aspect the problem underlying the present invention is solved by the use of a 
polypeptide according to any aspect of the present invention for the manufacture of a functional 
nucleic acid, whereby the functional nucleic acid is selected from the group comprising aptamers 
and spiegelmers. 

In a thirty-second aspect the problem xmderlying the present invention is solved by the use of a 
nucleic acid molecule according to any aspect of the present invention for the manufacture of a 
functional ribonucleic acid, whereby the functional ribonucleic acid is selected from the group 
comprising ribozymes, antisense nucleic acids and siRNA. 

In a thirty-third aspect the problem underlying the present invention is solved by the use of a 
polypeptide according to the present invention or a fragment thereof as an antigen. 

In a thirty-fourth aspect the problem underlying the present invention is solved by the use of a 
nucleic acid according to any aspect of the present invention for the manufacture or generation of 
a functional nucleic acid, preferably a ribonucleic acid, wherein the functional ribonucleic acid is 
selected from the group comprising ribozymes, antisense nucleic acids and siRNA. 

In a thirty-fifth aspect the problem underlying the present invention is solved by the use of the 
polypeptides according to the present invention or any fragment thereof for the generation or 
manufacture of an antibody. 

As used herein the term SEQ ID NO X to SEQ ID NO Y is an abbreviation for any of the SEQ 
ID Nos comprised by X and Y including X and Y. 

The present inventors have surprisingly found that the genomes of GBS comprises a variety of 
adhesion factors which share a common amino acid motive. This amino acid motive is 
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responsible for the binding of the adhesion factor to fibrinogen. As used herein, an adhesion 
• factor is a factor, preferable a peptide or a protein which mediates the binding of a 
microorganism to a substrate. Preferably, the microorganism is GBS. More preferably, the 
substrate is fibrinogen and a host cell, respectively. The adhesion factor as used herein can be an 
adhesin or an invasin. The common amino acid motive can be described as follows using the one 
letter code for amino acids: 

G-N/S/T-V-L-A/E/M/Q-R-R-X-K/R/W-A/D/E/N^^^ (SEQ 
ID NO 222). 

As may be taken from the above sequence the amino acid motive comprises a total of 16 
positions. Some of the positions have to be occupied by a distinct amino acid such as, e.g., 
position 1 or 3 or 4. Other positions such as positions 15 or 16 may be occupied by any amino 
acid, preferably by a naturally occurring amino acid. These positions are marked in the above 
sequence with an *X*. Still further positions can be occupied by different amino acids. These 
different amino acids are indicated in the above motive, whereby the various amino acids are 
separated by 7\ Accordingly, at position 2 N, S or T may be present. Any permutations of the 
above sequence of amino acids can be realized by the one skilled in the art, which are thus within 
the scope of the present invention. 

The present invention is thus related in one aspect to the above amino acid motive. More 
particularly, the present invention is related to any peptide or polypeptide which comprises this 
amino acid motive. It is to be understood that the terms peptide and polypeptide are used in a 
synonymous way if not indicated to the contrary. 

Polypeptides, as used herein, include all polypeptides as described below. The basic structxire of 
polypeptides is well known and has been described in innumerable textbooks and other 
publications in the art. In this context, the term is used herein to refer to any peptide or protein 
comprising two or more amino acids joined to each other in a linear chain by peptide bonds. As 
used herein, unless otherwise indicated, the term refers to both short chains, which also 
commonly are referred to in the art as peptides, oligopeptides and oligomers, for example, and to 
longer chains, which generally are referred to in the art as proteins, of which there are many 
types. It will be appreciated that polypeptides often contain amino acids other than the 20 amino 
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acids commonly referred to as the 20 naturally occurring amino acids, and that many amino 
acids, including the terminal amino acids, may be modified in a given polypeptide, either by 
natural processes, such as processing and other post-translational modifications, but also by 
chemical modification techniques which are well known to the art. Even the common 
modifications that occur naturally in polypeptides are too numerous to list exhaustively here, but 
they are well described in basic texts and in more detailed monographs, as well as in a 
voluminous research literature, and they are well known to those of skill in the art. Among the 
known modifications which may be present in polypeptides of the present are, to name an 
illustrative few, acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of 
flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, 
formation of covalent cross-links, formation of cystine, formation of pyroglutamate, formylation, 
gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, 
methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 
racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. Such modifications are well known to those of 
skill and have been described in great detail in the scientific literature. Several particularly 
conunon modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylation of 
glutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are described in most 
basic texts, such as, for instance PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES, 2"^ Ed., T,E. Creighton, W.H. Freeman and Company, New York (1993). Many 
detailed reviews are available on this subject, such as, for example, those provided by Wold, F., 
Posttranslational Protein Modifications: Perspectives and Prospects, pgs. 1-12 in 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., 
Academic Press, New York (1983); Seifter et al., Meth, EnzymoL 182:626-646 (1990) and 
Rattan et aL, Protein Synthesis: Posttranslational Modification and Aging, Arm, N.Y. Acad. Sci. 
663:48-62 (1992). It will be appreciated, as is well known and as noted above, that polypeptides 
are not always entirely linear. For instance, polypeptides may be generally as a result of 
posttranslational event, including natural processing event and events brought about by human 
manipulation which do not occur naturally. Circular, branched and branched circular 
polypq>tides may be synthesized by non-translation natural process and by entirely synthetic 
methods, as well. Modifications can occur anywhere in a polypeptide, including the peptide 
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backbone, the amino acid side chains and the amino or carboxyl temiini. In fact, blockage of the 
* amino or carboxyl group in a polypeptide, or both, by a covalent modification, is common in 
naturally occiirring and synthetic polypeptides and such modifications may be present in 
polypeptides of the present invention, as well. For instance, the amino terminal residue of 
polypeptides made in E. coli or other cells, prior to proteolytic processing, almost invariably will 
be N-formylmethionine. During post-translational modification of the peptide, a methionine 
i^sidue at the NH2-tenninus may be deleted. Accordingly, this invention contemplates the use of 
both the methionine-containing and the methionineless amino terminal variant of the protein of 
the invention. The modifications that occur in a polypeptide often will be a function of how it is 
made. For polypeptides made by expressing a cloned gene in a host, for instance, the nature and 
extent of the modifications in large part will be determined by the host cell posttranslational 
modification capacity and the modification signals present in the polypeptide amino acid 
sequence. For instance, as is well known, glycosylation often does not occur in bacterial hosts 
such as, for example, E. colL Accordingly, when glycosylation is desired, a polypeptide should 
be expressed in glycosylating host, generally a eukaryotic cell. Insect cells often carry out the 
same posttranslational glycosylations as mammalian cells and, for this reason, insect cell 
expression systems have been developed to express efficiently mammahan proteins having 
native pattems of glycosylation, inter alia. Similar considerations apply to other modifications. It 
will be appreciated that the same type of modification may be present in tiie same or varying 
degree at several sites in a given polypeptide. Also, a given polypeptide may contain many types 
of modifications. In general, as used herein, the term polypeptide encompasses all such 
modifications, particularly those that are present in polypeptides synthesized recombinantly by 
expressing a polynucleotide in a host cell. 

Any polypeptide comprising the amino acid motive is regarded as a polypeptide according to the 
present invention. As explained in greater detail in the examples, the present inventors have 
found that GBS comprises a number of adhesion factors which comprise not only one copy of 
the amino acid motive but a number thereof. Thus any polypeptide comprising a plurality or 
being composed of a plurality of the amino acid motive is a polyp^tide according to the present 
invention. For example, the adhesion factor referred to herein as FbsA may comprise as little as 
one unit of the amino acid motive to as much as 19 copies thereof. 
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Other adhesion factors according to the present invention are those referred to herein as PabA, 
* PabB, PabC and PabD. It is to be understood that the term polypeptides according to the present 
invention also comprise any fragment, derivative or analog thereof. Further preferred 
polypeptides according to the present invention are those the amino acid sequence of which 
corresponds to SEQ ID 11 to 20. 

The fragment, derivative or analog of the polypeptide of the present invention may be (i) one in 
which one or more of the amino acid residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid residue) and such substituted amino acid 
residue may or may not be one encoded by the genetic code, or (ii) one in which one or more of 
the amino acid residues includes a substituent group, or (iii) one in which the mature polypeptide 
is fused with another compound, such as a compound to increase the half-life of the polypeptide 
(for example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the mature polypeptide, such as a leader or secretory sequence or a sequence which is employed 
for purification of the mature polypq>tide or a proprotein sequence. Such fragments, derivatives 
and analogs are deemed to be within the scope of those skilled in the art from the teachings 
herein. 

Among the particularly preferred embodiments of the invention in this regard are polypeptides 
set forth in the Sequence Listing, variants, analogs, derivatives and fragments thereof, and 
variants, analogs and derivatives of the fragments. Additionally, fusion polypeptides comprising 
such polypeptides, variants, analogs, derivatives and fragments thereof, and variants, analogs and 
derivatives of the fragments, in addition to a heterologous polypeptide, are contemplated by the 
present invention. Such fusion polypeptides and proteins, as well as polynucleotides encoding 
them, can readily be made using standard techniques, including standard recombinant techniques 
for producing and expressing a recombinant polynucleic acid encoding a fusion protein. 

Among preferred variants are those that vary from a reference by conservative amino acid 
substitutions. Such substitutions are those that substitute a given amino acid in a polypeptide by 
another amino acid of like characteristics. Typically seen as conservative substitutions are the 
replacements, one for another, among the aliphatic amino acids Ala, Val, Leu and lie; 
interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, 



wo 2004/035618 




PCT/EP2003/011436 



substitution between the amide residues Asn and Gin, exchange of the basic residues Lys and 
Arg and replacements among the aromatic residues Phe and Tyr. 

Further particularly preferred in this regard are variants, analogs, derivatives and fragments, and 
variants, analogs and derivatives of the fragment, having the amino acid sequence of any 
polypeptide set forth in the Sequence Listing, in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 
or no amino acid residues are substituted, deleted or added, in any combination. Especially 
preferred among these are silent substitutions, additions and deletions, which do not alter the 
properties and activities of the polypeptide of the present invention. Also especially preferred in 
this regard are conservative substitutions. Most highly preferred polypeptides having an amino 
acid sequence set forth in the Sequence Listing without substitutions. 

The polypeptides and polynucleotides of the present invention are preferably provided in an 
isolated form, and preferably are purified to homogeneity. Also the polypeptides according to the 
present invention are preferably isolated polypeptides. 

The polypeptides of the present invention include any polypeptide set forth in the Sequence 
Listing (in particular a mature polypeptide) as well as polypeptides which have at least 70 % 
identity to a polypeptide set forth in the Sequence Listing, preferably at least 80 % or 85 % 
identity to a polypeptide set forth in the Sequence Listing, and more preferably at least 90 % 
similarity (more preferably at least 90 % identity) to a polypeptide set forth in the Sequence 
Listing and still more preferably at least 95 %, 96 %, 97 %, 98 %, 99 %, or 99.5 % similarity 
(still more preferably at least 95 %, 96 %, 97%, 98 %, 99 %, or 99. 5 % identity) to a polypeptide 
set forth in the Sequence Listing and also include portions of such polypeptides with such 
portion of the polypeptide generally containing at least 5 amino acids and more preferably at 
least 10, 15 or 16 or multiples thereof Preferably, the multiples are multiples of a repeat of 16 
amino acids, whereby the 16 amino acids correspond to the amino acid motive as disclosed 
herein. 

Fragments or portions of the polypeptides of the present invention may be employed for 
producing the corresponding full-length polypeptide by peptide synthesis; therefore, the 
fragments may be employed as intermediates for producing the full-length polypeptides. 
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Fragments or portions of the polynucleotides of the present invention may be used to synthesize 
^ full-length polynucleotides of the present invention. 

Also among preferred embodiments of this aspect of the present invention are polypeptides 
comprising fragments of the polypeptide having the anuno acid sequence set forth in the 
Sequence Listing, and fragments of variants and derivatives of the polypeptides set forth in the 
Sequence Listing. 

As used herein a fragment is a polypeptide having an amino acid sequence that entirely is the 
same as part but not all of the amino acid sequence of the aforementioned S. agalactiae 
polypeptides and variants or derivatives thereof. 

Such fragments may be "free-standing", i. e., not part of or fused to another amino acids or 
polypeptides, or they may be comprised within a larger polypeptide of which they form a part or 
region. When comprised within a larger polypeptide, the presently discussed fragments most 
preferably form a single continuous region. Howeva:, several fragments may be comprised 
within a single larger polypeptide. For instance, certain preferred embodiments relate to a 
fragment of a polypeptide of the present invention comprised within a precursor polypeptide 
designed for expression in a host and having heterologous pre and pro-polypeptide regions fused 
to the amino terminus of the fragment and an additional region fused to the caiboxyl terminus of 
the fragment. Therefore, fragments in one aspect of the meaning intended herein, refers to the 
portion or portions of a fusion polypeptide or fusion protein derived from a polypeptide of the 
present invention. 

Representative examples of polypeptide fragments of the invention, include, for example, in any 
selected polypeptide, fragments from about amino acid number 45 - 60, 61 - 76, 77 - 92, 93 - 
108, 109 - 124, 125 ~ 140, 141 - 156, 157 - 172, 173 - 188, 189 - 204, 205 - 220, 221 - 236, 
237 - 252, 253 - 268, 269 - 284, 285 - 300, 301 - 316, 317 - 332, 333 - 348, 410 - 414 of the 
amino acid sequences disclosed herein, or any of the repeats, either alone or in combination with 
one or several of the ones mentioned in the following tables 1 and 2, optionally combined wifli 
the signal peptide or the LPXTG motif. 
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Table 1: 



FbsA of GBS strain 6313 


FbsA of GBS strain 706 S2 


1—35 signal peptide 


1—35 signal peptide 


45 - 60 repeat 1 (SEQ ID 1 13) 


45 - 60 repeat 1 (SEQ ID 132) 


61 - 76 repeat 2 (SEQ ID 1 14) 


61 - 76 repeat 2 (SEQ ID 133) 


77 - 92 repeat 3 (SEQ ID 1 15) 


77 - 92 repeat 3 (SEQ ID 134) 


93 - 1 08 repeat 4 (SEQ ID 1 16) 


93-108 repeat 4 (SEQ ID 135) 


1 09 - 124 repeat 5 (SEQ ID 1 17) 


109-124 repeat 5 (SEQ ID 136) 


125-140 repeat 6 (SEQ ID 1 18) 


125 - 140 repeat 6 (SEQ ID 137) 


141 - 156 repeat 7 (SEQ ID 119) 


141 - 156 repeat 7 (SEQ ID 138) 


157- 172 repeat 8 (SEQ ID 120) 


157 - 172 repeat 8 (SEQ ID 139) 


173- 188repeat 9(SEQID 121) 


173-188 repeat 9 (SEQ ID 140) 


189 - 204 repeat 10 (SEQ ID 122) 


189 - 204 repeat 10 (SEQ ID 141) 


205 - 220 repeat 1 1 (SEQ ID 123) 


205 - 220 repeat 1 1 (SEQ ID 142) 


221 - 236 repeat 12 (SEQ ID 124) 


221 - 236 repeat 12 (SEQ ID 143) 


237 - 252 repeat 13 (SEQ ID 125) 


237 - 252 repeat 13 (SEQ ID 144) 


253 - 268 repeat 14 (SEQ ID 126) 


253 - 268 repeat 14 (SEQ ID 145) 


269 - 284 repeat 15 (SEQ ID 127) 


269 - 284 repeat 15 (SEQ ID 146) 


285-300 repeat 16 (SEQ ID 128) 


285 - 300 repeat 16 (SEQ ID 147) 


301 - 316 repeat 17 (SEQ ID 129) 


301-316 repeat 17 (SEQ ID 148) 


317 - 332 repeat 18 (SEQ ID 130) 


378- 382 LPXTG motif 


333 - 348 repeat 19 (SEQ ID 131) 




410-414LPXTG motif 




Table 2: 


FbsA of GBS strain 33 HI A 


FbsA of GBS strain 176 H4A 


1—35 signal peptide 


1-35 signal peptide 


45 - 60 repeat 1 (SEQ ID 149) 


45 - 60 repeat 1 (SEQ ID 162) 


61 - 76 repeat 2 (SEQ ID 150) 


61 - 76 repeat 2 (SEQ ID 163) 


77 - 92 repeat 3 (SEQ ID 151) 


77 - 92 repeat 3 (SEQ ID 1 64) 


93-108 repeat 4 (SEQ ID 152) 


154 -158 LPXTG motif 


109 - 124 repeat 5 (SEQ ID 153) 




125 - 140 repeat 6 (SEQ ID 154) 




141-156 repeat 7 (SEQ ID 155) 




157-172 repeat 8 (SEQ ID 156) 




173-188 repeat 9 (SEQ ID 157) 




189 - 204 repeat 10 (SEQ ID 158) 




205 - 220 repeat 1 1 (SEQ ID 159) 




221 - 236 repeat 12 (SEQ ID 160) 




237 - 252 repeat 13 (SEQ ID 161) 




314- 318 LPXTG motif 
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Table 3: 



FbsA of GBS strain O90R 


FbsA of GBS strain SSI 169 


1-35 signal peptide 


1 - 34 signal peptide 


45 - 60 repeat 1 (SEQ ID 165) 


45 - 60 repeat 1 (SEQ ID 175) 


61-76 repeat 2 (SEQ ID 166) 


61-76 repeat 2 (SEQ ID 176) 


77 - 92 repeat 3 (SEQ ID 167) 


77 - 92 repeat 3 (SEQ ID 177) 


93-108 repeat 4 (SEQ ID 168) 


93-108 repeat 4 (SEQ ID 178) 


109-124 repeat 5 (SEQ ID 169) 


109 - 124 repeat 5 (SEQ ID 179) 


125 - 140 repeat 6 (SEQ ID 170) 


125-140 repeat 6 (SEQ ID 180) 


141 - 156 repeat 7 (SEQ ID 171) 


141 - 156 repeat 7 (SEQ ID 181) 


157-172 repeat 8 (SEQ ID 172) 


157- 172 repeat 8 (SEQ ID 182) 


173 - 188 repeat 9 (SEQ ID 173) 


173- 188 repeat 9 (SEQ ID 183) 


189 - 204 repeat 10 (SEQ ID 174) 


189 - 204 repeat 10 (SEQ ID 184) 


267 - 270 LPXTG motif 


205 - 220 repeat 1 1 (SEQ ID 185) 




221 - 236 repeat 12 (SEQ ID 186) 




237 - 252 repeat 13 (SEQ ID 187) 




253 - 268 repeat 14 (SEQ ID 188) 




269 - 284 repeat 15 (SEQ ID 189) 




285 - 300 repeat 16 (SEQ ID 190) 




301 - 316 repeat 17 (SEQ ID 191) 




317-332 repeat 18 (SEQ ID 192) 




333 - 348 repeat 19 (SEQ ID 193) 




349 - 364 repeat 20 (SEQ ID 194) 




365 - 380 repeat 21 (SEQ ID 195) 




381 - 396 repeat 22 (SEQ ID 196) 




397 - 412 repeat 23 (SEQ ID 197) 




413 - 428 repeat 24 (SEQ ID 198) 




429 - 444 repeat 25 (SEQ ID 199) 




445 - 460 repeat 26 (SEQ ID 200) 




461 - 476 repeat 27 (SEQ ID 201) 




477 _ 492 repeat 28 (SEQ ED 202) 




493 - 508 repeat 29 (SEQ ID 203) 




509 - 524 repeat 30 (SEQ ID 204) 




586 - 590 LPXTG motif 



As used herein "about" includes the particularly recited ranges larger or smaller by several, a 
few, 5, 4, 3, 2 or 1 amino acid at either extreme or at both extremes. 

Preferred fragments of the invention include, for example, truncation polypeptides including 
polypeptides having an amino acid sequence set forth in the Sequence Listing, or of variants or 
derivatives thereof, except for deletion of a continuous series of residues (that is, a continuous 
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region, part or portion) that includes the amino terminus, or a continuous series of residues that 
includes the carfooxyl tenninus or, as in double truncation mutants, deletion of two continuous 
series of residues, one including the amino terminus and one including the caifooxyl tenninus. 
Fragments having the size ranges set out above also are preferred embodiments of truncation 
fragments, which are especially preferred among fragments generally. Degradation forms of the 
polypeptides of the invention in a host cell are also preferred. 

Also preferred in this aspect of the invention are fragments characterized by structural or 
functional attributes of the polypeptide of the present invention. Preferred embodiments of the 
invention in this regard include fragments that comprise alpha-helix and alpha-helix forming 
regions, beta-sheet and beta-sheet-forming regions, turn and turn- forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, substrate binding region, and 
high antigenic index regions of the polypeptide of the present invention, and combinations of 
such fragments. 

Preferred regions are those that mediate activities of the polypeptide of the present invention. 
Most highly preferred in this regard are fragments that have a chemical, biological or other 
activity of the polypeptide of the present invention, including those with a similar activity or an 
improved activity, or with a decreased undesirable activity. Particularly preferred are fragments 
comprising a receptor activity for such as, e.g., fibrinogen in case of FbsA or the host cell in case 
of PabA, PabB, PabC und PabD that confer a fimction essential for the ability of 5. agalactiae to 
cause disease in hiunans and/or that are able to mediate the adherence and/or invasion of S. 
agalactiae to or into epithelial cells, more preferably human epithelial cells. Further preferred 
polypeptide fragments are those that comprise or contain antigenic or immunogenic determinants 
in an animal, especially in a human. A host cell as used herein is a cell which is capable of 
uptaking of GBS in the natural host or in an internalization assay such as, e.g., the one as 
described in example 1. 

The polypeptides according to the present invention may be used for the detection of the 
organism or organisms in a sample containing these polypeptides. Preferably such detection is 
for diagnosis, more preferable for the diagnosis of a disease, most preferably for the diagnosis of 
a disease related or linked to the presence or abundance of Gram-positive bacteria, especially 
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bacteria selected from the group comprising streptococci, staphylococci and lactococci. More 
• preferably, the microorganisms are selected from the group comprising Streptococcus 
agalactiae. Streptococcus pyogenes^ Streptococcus pneumoniae and Streptococcus mutans. 

The present invention also relates to diagnostic assays such as quantitative and diagnostic assays 
for detecting levels of the polypeptide of the present invention in cells and tissues, including 
determination of normal and abnormal levels. Thus, for instance, a diagnostic assay in 
accordance with the invention for detecting over-expression of the polypeptide compared to 
normal control tissue samples may be used to detect the presence of an infection, for example, 
and to identify the infecting organism. Assay techniques that can be used to determine levels of a 
polypeptide, in a sample derived from a host are well-known to those of skill in the art. Such 
assay methods include radioimmunoassays, competitive-binding assays, Westem Blot analysis 
and ELISA assays. Among these, ELISAs frequently are preferred. An ELISA assay initially 
comprises preparing an antibody specific to the polypeptide, preferably a monoclonal antibody. 
In addition, a reporter antibody generally is prepared which binds to the monoclonal antibody. 
The reporter antibody is attached to a detectable reagent such as radioactive, fluorescent or 
enzymatic reagent, such as horseradish peroxidase enzyme. 

The polypeptides according to the present invention may also be used for the purpose of or in 
coimection with an array. More particularly, at least one of the polypeptides according to the 
present invention may be inmiobilized on a support. Said support typically comprises a variety of 
polypeptides whereby the variety may be created by using one or several of the polypeptides 
according to the present invention and/or polypeptides being different therefrom. The 
characterizing feature of such array as well as of any array in general is the fact that at a distinct 
or predefined region or position on said support or a surface thereof, a distinct polypeptide is 
immobilized. Because of this any activity at a distinct position or region of an array can be 
correlated with a specific polypeptide. The number of different polypeptides immobilized on a 
support may range from as little as 10 to several 1000 different polypeptides. The density of 
polypeptides per cm^ is in a preferred embodiment as Httle as 10 oligonucleotides per cm* to at 
least 400 different polynucleotides per cm* and more particularly at least 1000 different 
polypeptides per cm*. 
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The manufacture of such arrays is known to the one skilled in the art and, for example, described 
• in US patent 5,744,309. The array preferably comprises a planar, porous or non-porous solid 
support having at least a first surface. The polypeptides as disclosed herein, are immobilized on 
said surface. Preferred support materials are, among others, glass or cellulose. It is also within 
the present invention that the array is used for any of the diagnostic applications described 
herein. Apart fi-om the polypeptides according to the present invention also the nucleic acid 
molecules according to the present invention may be used for the generation of an array as 
described above. This applies as well to an array made of antibodies, preferably monoclonal 
antibodies as, among others, described herein. 

The isolated nucleic acid molecule according to the present invention, also referred to herein as 
the nucleic acid (molecule) according to the present invention, codes for the amino acid motive 
and the polypeptides according to the present invention. The nucleic acid molecule according to 
the present invention can in a first alternative be a nucleic acid (molecule) which has an identity 
of at least 70 % to a nucleic acid molecule which has the nucleic acid sequence as specified in 
SEQ ID No.l to 10. It is also within the present invention that the isolated nucleic acid molecule 
has a similarity of at least 70 % of any sequence, which encodes any of the polypeptides of the 
present invention. Preferably, the identity is at least 80 % and more preferably the identity is at 
least 90 %. Identity may also be 95%, 96 %, 97 %, 98 %, 99% or 99.5 %. 

Identity, as known in the art and used herein, is the relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, identity also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the mach 
between strings of such sequences. Identity can be readily calculated {Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 
and Genome Projects, Smith, D.W., ed.. Academic Press, New York, 1993; Computer Analysis 
of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 
1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 
1991). While there exist a number of methods to measure identity between two polynucleotide or 
two polypeptide sequences, the term is well known to skilled artisans (Sequence Analysis in 
Molecular Biology, von Heinje, G., Academic Press, 1987; Sequence Analysis Primer, Gribskov, 
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M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., 
' SIAM J. Applied Math., 48: 1073 (1988)). Preferred methods to determine identity are designed 
to give the largest match between the sequences tested. Methods to determine identity are 
codified in computer programs. Preferred computer program methods to determine identity 
between two sequences include, but are not limited to, GCG program package (Devereux, J., et 
aL, Nucleic Acids Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Atschul, S.F. 
et aL, 7. Molea Biol, 215: 403 (1990)). 

The nucleic acid according to the present invention can as a second alternative also be a nucleic 
acid which is at least essentially complementary to the nucleic acid described as the first 
alternative above. As used herein complementary means that a nucleic acid strand is base pairing 
via Watson-Crick base pairing with a second nucleic acid strand. Essentially complementary as 
used herein means that the base pairing is not occurring for all of the bases of the respective 
strands but leaves a certain number or percentage of the bases unpaired or wrongly paired. The 
percentage of correctly pairing bases is preferably at least 70 %, more preferably 80 %, even 
more preferably 90 % and most preferably any percentage higiher than 90 %. It is to be noted that 
a percentage of 70 % matching bases is considered as homology and the hybridization having 
this extent of matching base pairs is considered as stringent. Hybridization conditions for this 
kind of stringent hybridization may be taken firom Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., 1987. More particularly, the hybridization conditions can be as follows: 



• Hybridization performed e.g. in 5 x SSPE, 5 x Denhardfs reagent, 0.1% SDS, 100 g/mL 
sheared DNA at 68°C 

• Moderate stringency wash in 0.2xSSC, 0.1% SDS at 42°C 

• High stringency wash in O.lxSSC, 0.1% SDS at 68^C 

Genomic DNA with a GC content of 50% has an approximate Tm of 96^C. For 1% mismatch, 
the Tm is reduced by approximately 1°C. 



In addition, any of the fiirther hybridization conditions described herein are in principle 
applicable as well. 
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The nucleic acid according to the present invention can as a third alternative also be a nucleic 
' acid which comprises a stretch of at least 15 bases of the nucleic acid according to the first and 
second alternative of the nucleic acid molecule according to the present invention as outlined 
above. Preferably, the bases form a contiguous stretch of bases. However, it is also within the 
present invention that the stretch consists of two or more moieties which are separated by a 
number of bases. 

The nucleic acid according to the present invention can as a fourth alternative also be a nucleic 
acid which anneals under stringent hybridisation conditions to any of the nucleic acids of the 
present invention according to the above outlined first, second, and third altemative. Stringent 
hybridisation conditions are typically those described herein. 

Finally, the nucleic acid according to the present invention can as a fifth altemative also be a 
nucleic acid which, but for the degeneracy of the genetic code, would hybridise to any of the 
nucleic acids according to any of the nucleic acids of the present invention according to the first, 
second, third, and fourth altemative as outlined above. This kind of nucleic acid refers to the fact 
that preferably the nucleic acids according to the present invention code for the polypeptides 
according to the present invention and thus for adhesins and invasions, respectively- This kind 6f 
nucleic acid is particularly usefiil in the detection and thus diagnosis of the nucleic acid 
molecules according to the present invention and thus of the respective microorganisms such as 
GBS and any disease or diseased condition where this kind of microorganims is involved. 
Preferably, the hybridisation would occur or be preformed under stringent conditions as 
described in connection with the fourth altemative described above. 

Polynulceotide(s) as used herein generally refer to any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
Thus, for instance, polynucleotides as used herein refers to, among other, single-and double- 
stranded DNA, DNA that is a mixture of single- and double-stranded RNA, and RNA that is a 
mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double-stranded, or triple-stranded, or a mixture 
of single- and double-stranded regions. In addition, polynucleotide as used herein refers to triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. The strands in such regions 
may be from the same molecule or fi^om different molecules. The regions may include all of one 
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or more of the molecules, but more typically involve only a region of some of the molecules. 
One of the molecules of a triple-helical region often is an oligonucleotide. As used herein, the 
term polynucleotide includes DNAs or RNAs as described above that contain one or more 
modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons 
are "polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs comprising 
unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two 
examples, are polynucleotides as the term is used herein. It will be appreciated that a great 
variety of modifications have been made to DNA and RNA that serve many usefiil puiposes 
known to those of skill in the art. The term polynucleotide as it is employed herein embraces 
such chemically, enzymatically or metabolically modified forms of polynucleotides, as well as 
the chemical forms of DNA and RNA characteristic of viruses and cells, including simple and 
complex cells, inter alia. The term polynucleotide also embraces short polynucleotides often 
referred to as oligonucleotide(s). "Polynucleotide" and "nucleic acid" or "nucleic acid molecule" 
are often used interchangeably herein. 

Using the inforaiation provided herein and known, standard methods, such as those for cloning 
and sequencing and those for synthesizing polynucleotides and polypeptides (see, e.g., Sambrook 
et al.. Molecular Cloning: A Laboratory Manual, 2""^ Ed., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY (1989)), one can generate numerous unique fiagments, bofli longer and 
shorter than the polynucleotides and polypeptides set forth in the Sequence Listing, of the 5. 
agalactiae genome and the S. agalactiae coding regions, which are encompassed by the present 
invention. To be unique, a fi-agment must be of sufficient size to distinguish it fi-om other known 
nucleic acid sequences, most readily determined by comparing any selected S, agalactiae 
fiagment to the nucleotide sequences in computer databases such as GenBank. Such comparative 
searches are standard in the ait. Many unique fragments will be S. agalactiae - specific. 
Typically, a unique fiagment usefiil as a primer or probe will be at least about 20 to 25 
nucleotides in length, depending upon the specific nucleotide content of the sequence. 
Additionally, fi-agments can be, for example, at least about 30, 40, 50, 60, 75, 80, 90, 100, 150, 
200, 250, 300, 400, 500 or more nucleotides in length. The nucleic acid fi-agment can be single, 
double or triple stranded, depending upon the purpose for which it is intended. 

Additionally, as discussed above and below, modifications can be made to the SI agalactiae 
polynucleotides and polypeptides that are encompassed by the present invention. For example. 
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nucleotide substitutions can be made which do not affect the polypeptide encoded by the nucleic 
- acid, and thus any polynucleotide which encodes the polypeptides of this invention is within the 
present invention. Additionally, certain amino acid substitutions (and corresponding nucleotide 
substitutions to encode them) can be made which are known in the art to be neutral (Robinson 
W.E. Jr. and Mitchell, W.m., AIDS 4: S141-S162 (1990)), Such variations may arise naturally as 
allelic variations (e. g. due to genetic polymoiphism) or may be produced by human intervention 
(e. g. by mutagenesis of cloned DNA sequences), such as induced point, deletion, insertion and 
substitution mutations. Minor changes in amino acid sequences are generally preferred, such as 
conservative amino acid replacements, small internal deletions or insertions, and additions or 
deletions at the ends of the molecules. Substitutions may be designed based on, for example, the 
model of Dayhoff, et al. (in Atlas of Protein Sequence and Structure 1978, Nat'I Biomed. Res. 
Found., Washington D.C.). These modifications can result in changes in the amino acid 
sequence, provide silent mutations, modify a restriction site, or provide other specific mutations. 
Likewise, such amino acid changes result in a different nucleic acid encoding the polypeptides 
and proteins. Thus, alternative polynucleotides, which are within the parameters of the present 
invention, are contemplated by such modifications. 

Fiirthermore, some of the polynucleotide sequences set forth in the Sequence Listing are open 
reading fi-ames (ORFs), i. e. coding regions of 5. agalactiae. The polypeptide encoded by each 
open reading fi-ame can be deduced, and the molecular weight of the polypeptide thus calculated 
using amino acid residue molecular weight values well known in the art. Any selected coding 
region can be fimctionally linked, using standard techniques such as standard subcloning 
techniques, to any desired regulatory sequence, whether a S. agalactiae regulatory sequence or a 
heterologous regulatory sequence, or to a heterologous coding sequence to create a fiision 
protein, as further described herein. 

Polynucleotides of the present invention may be in the form of RNA, such as mRN A or cRN A, 
or in the form of DNA, including, for instance, cDNA and genomic DNA obtained by cloning or 
produced by chemical synthetic techniques or by a combination thereof. The DNA may be triple- 
stranded, double-stranded or single-stranded. Single-stranded DNA may be the coding strand, 
also known as the sense strand, or it may be the non-coding strand, also referred to as the anti- 
sense strand. 
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The coding sequence which encodes a 5. agalactiae polypeptide of this invention may be 
' identical to the coding sequence of a polynucleotide set forth in the sequence listing. It also may 
be a polynucleotide with a different sequence which, as a result of the redundancy (degeneracy) 
of the genetic code, encodes a S. agalactiae polypeptide set forth in the sequence listing. 

Polynucleotides of the present invention which encode a 5. agalactiae polypeptide as disclosed 
herein, including those set forth in the sequence listing may include, but are not limited to, the 
coding sequence for a mature polypeptide, by itself; the coding sequence for a mature 
polypeptide and additional coding sequences, such as those encoding a leader or secretory 
sequence, such as a pre-, or pro- or prepro- protein sequence; the coding sequence of a mature 
polypeptide, with or without the aforementioned additional coding sequences, together with 
additional, non-coding sequences, including for example, but not limited to non-coding 5' and 3' 
sequences, such as the transcribed, non-translated sequences that play a role in transcription 
(including termination signals, for example), ribosome binding, mRNA stability elements, and 
additional coding sequence which encode additional amino acids, such as those which provide 
additional functionalities. Thus, for instance, a polypeptide may be fused to a marker sequence, 
such as a peptide, which facilitates purification of the fiised polypeptide. In certain embodiments 
of this aspect of the invention, the marker sequence is a hexa-histidine peptide, such as the tag 
provided in tfie pQE vector (Qiagen, Inc.), among others, many of which are commercially 
available. As described in Gentz et aL, Proc. Natl Acad, ScL. USA 86: 821-824 (1989), for 
instance, hexa-histidine provides for convenient purificaion of tiie fusion protein. The HA tag 
may also be used to create fusion proteins and corresponds to an epitope derived of influenza 
hermagglutinin protein, which has been described by Wilson et al.. Cell 37:767 (1984), for 
instance. Polynucleotides of the invention also include, but are not limited to, polynucleotides 
comprising a structural gene and its naturally associated genetic elements. 

In accordance with the foregoing, the term "polynucleotide encoding a polypeptide" as used 
herein encompasses polynucleotides which include a sequence encoding a polypeptide of the 
present invention, particularly a polypeptide having a S. agalactiae amino acid sequence set forth 
in the Sequence Listing. The tenn encompasses polynucleotides that include a single continuous 
region or discontinuous regions encoding the polypeptide (for example, interrupted by integrated 
phage or insertion sequence or editing) together with additional regions, that also may contain 
coding and/or non-coding sequences. 
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The present invention further relates to variants of the herein above described polynucleotides 
which encode for fragments, analogs and derivatives of the polypeptide having a deducted S. 
agalactiae amino acid sequence set forth in the Sequence Listing. A variant of the 
polynucleotide may be a naturally occurring variant such as a naturally occurring allelic variant, 
or it may be a variant that is not known to occur naturally. Such non-naturally occurring variants 
of the polynucleotide may be made by mutagenesis techniques, including those applied to 
polynucleotides, cells or organisms. 

Among variants in this regard are variants that differ from the aforementioned polynucleotides 
by nucleotide substitutions, deletions or additions. The substitutions, deletions or additions may 
involve one or more nucleotides. The variants may be altered in coding or on-coding regions or 
both. Alterations in the coding regions may produce conservative or non-conservative amino 
acid substitutions, deletions or additions. Preferred are polynucleotides encoding a variant, 
analog, derivative or fragment, or a variant, analogue or derivative of a fragment, which have a 
S. agalactiae sequence as set forth in the Sequence Listing, in which several, a few, 5 to 10, 1 to 
5, 1 to 3, 2, 1 or no amino acid(s) is substituted, deleted or added, in any combination. Especially 
preferred among these are silent substitutions, additions and deletions, which do not alter the 
properties and activities of the S. agalactiae polypeptides set forth in the Sequence Listing. Also 
especially preferred in this regard are conservative substitutions. 

Further preferred embodiments of the invention are polynucleotides that are at least 70 % 
identical over their entire length to a polynucleotide encoding a polypeptide according to the 
present invention and more particularly those polypeptides having an amino acid sequence set 
forth in the Sequence Listing, and polynucleotides which are complementary to such 
polynucleotides. Alternatively, most highly preferred are polynucleotides that comprise a region 
that is at least 80 % or at least 85 % identical over their entire length to a polynucleotide 
encoding a 5. agalactiae polypeptide according to the present invention and more particularly 
those polypeptides set forth in the Sequence Listing, including complementary polynucleotides. 
In this regard, polynucleotides at least 90 %, 91 %, 92 %, 93 %, 94 %, 95 %, or 96 % identical 
over their entire length to the same are particularly preferred, and among these particularly 
preferred polypeptides, those with at least 95 % are especially preferred. Furthermore, those with 
at least 97 % are highly preferred among those with at least 95 %, and among these, those with at 
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least 98 % and at least 99 % are particularly highly preferred, with at least 99 % or 99.5 % being 
' the more preferred. 

Preferred embodiments in this respect, moreover, are polynucleotides which encode polypeptides 
which retain substantially the same biological function or activity as the mature polypeptide 
encoded by the DNA set forth in the Sequence Listing. 

The present invention further relates to polynucleotides that hybridize to the herein above- 
described sequences. In this regard, the present invention especially relates to polynucleotides 
which hybridize under stringent conditions to the herein above-described polynucleotides. 
Stringent conditions are typically selective conditions. As herein used, the term "stringent 
conditions" means hybridization will occur only if there is at least 95 % and preferably at least 
97 % identity between the sequences. For a specific sequence, stringent conditions can be 
determined empirically according to the nucleotide content, as is known in the art and also 
exemplified herein. For example, a typical example of stringent conditions is hybridization of a 
48mer having 55 % GC content at 42'*C in 50 % foimamide and 750 mM NaCl followed by 
washing at 55*'C in 15 mM NaCl and 0.1 % SDS. 

As discussed additionally herein regarding polynucleotide assays of the invention, for instance, 
polynucleotides of the invention as discussed above, may be used as a hybridization probe for 
RNA, cDNA and genomic DNA to isolate full-length cDNAs and genomic clones encoding 
polypeptides of the present invention and to isolate cDNA and genomic clones of other genes 
that have a high sequence similarity to the polynucleotides of the present invention. Such probes 
generally will comprise at least 1 5 bases. Preferably, such probes will have at least 20, at least 25 
or at least 30 bases, and may have at least 50 bases. Particularly preferred probes will have at 
least 30 bases, and will have 50 bases or less, such as 30, 35, 40, 45, or 50 bases. 

For example, the coding region of the polynucleotide of the present invention may be isolated by 
screening using the known DNA sequence to synthesize an oUgonucleotide probe. A labeled 
oUgonucleotide having a sequence complementary to that of a gene of the present invention is 
then used to screen a library of cDNA, genomic DNA or mRNA to detemiine to which members 
of the library the probe hybridizes. 
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The polynucleotides and polypeptides of the present invention may be employed as reagents and 
materials for development of treatments of and diagnostics for disease, particularly human 
disease, as further discussed herein relating to polynucleotide assays, inter alia. 

The polynucleotides of the present invention that are oligonucleotides can be used in the 
processes herein as described, but preferably for PCR, to determine whether or not the S. 
agalactiae genes identified herein in whole or in part are present and/or transcribed in infected 
tissue such as blood. It is recognized that such sequences will also have utility in diagnosis of the 
stage of infection and type of infection the pathogen has attained. For this and other purposes the 
arrays comprising at least one of the nucleic acids according to the present invention as described 
herein, may be used. 

The polynucleotides may encode a polypeptide which is the mature protein plus additional amino 
or carboxyl-terminal amino acids, or amino acids interior to the mature polypeptide (when the 
mature form has more than one polypeptide chain, for instance). Such sequences may play a role 
in processing of a protein firom precursor to a mature form, may allow protein transport, may 
lengthen or shorten protein half-life or may facilitate manipulation of a protein for assay or 
production, among other things. As generally is the case in vivo, the additional amino acids may 
be processed away from the mature protein by cellular enzymes, 

A precursor protein, having the mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed such 
inactive precursors generally are activated. Some or all of the prosequences may be removed 
before activation. Generally, such precursors are called proproteins. 

The present invention additionally contemplates polynucleotides functionally encoding fusion 
polypeptides wherein the fusion polypeptide comprises a fragment of a 5. agalactiae polypeptide 
and one or more polypeptide(s) derived from another S, agalactiae polypeptide or from another 
organism or a synthetic polyamino acid sequence. Such polynucleotides may or may not encode 
amino acid sequences to facilitate cleavage of the 5. agalactiae polypeptide from the other 
polypeptide(s) imder appropriate conditions. 
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In sum, a polynucleotide of the present invention may preferably encode a mature protein, a 
• mature protein plus a leader sequence (which may be referred to as a preprotein), a precursor of a 
mature protein having one or more prosequences which are not the leader sequences of a 
preprotein, or a preproprotein, which is a precursor to a proprotein, having a leader sequence and 
one or more prosequences, which generally are removed during processing steps that produce 
active and mature forms of the polypeptide. 

Isolated as used herein means separated "by the hand of man" from its natural state; i.e., that, if it 
occurs in nature, it has been changed or removed from its original environment, or both. For 
example, a naturally occurring polynucleotide or a polypeptide naturally present in a living 
organism in its natural state is not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is "isolated", as the term is employed 
herein. As part of or following isolation, such polynucleotides can be joined to other 
polynucleotides, such as DNAs, for mutagenesis, to form fusion proteins, and for propagation or 
expression in a host, for instance. The isolated polynucleotides, alone or joined to other 
polynucleotides such as vectors, can be introduced into host cells, in culture or in whole 
organisms. Introduced into host cells in culture or in whole organisms, such DNAs still would be 
isolated, as the term is used herein, because they would not be in their naturally occurring form 
or environment. Similarly, the polynucleotides and polypeptides may occur in a composition, 
such as a media fomiulations, solutions for introduction of polynucleotides or polypeptides, for 
example, into cells, compositions or solutions for chemical or enzymatic reactions, for instance, 
which are not naturally occurring compositions, and, therein remain isolated polynucleotides or 
polypeptides within the meaning of that term as it is employed herein. 

The nucleic acids according to the present invention may be chemically synthesized. 
Alternatively, the nucleic acids can be isolated from various microorganisms by methods known 
to the one skilled in the art. Appropriate sources are, e.g. Streptococcus agalactiae. 
Streptococcus pyogenes. Streptococcus mutans and Streptococcus pneumoniae. 

The nucleic acids according to the present invention may be used for the detection of nucleic 
acids and organisms or samples containing these nucleic acids. Preferably such detection is for 
diagnosis, more preferable for the diagosis of a disease, most preferably for the diagnosis of a 
disease related or linked to the present or abimdance of S. agalactiae. 
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- S. agalactiae bacteria, which have infected eukaryotes (herein also *'individual(s)"), particularly 
mammals, and especially humans, may be detected at the DNA level by a variety of techniques. 
By selecting regions of nucleic acids that vary among strains of S. agalactiae^ preferred 
candidates for distinguishing a specific strain of S. agalactiae can be obtained. Furthermore, by 
selecting regions of nucleic acids that vary between S. agalactiae and other organisms, preferred 
candidates for distinguishing 5. agalactiae from other organisms can be obtained. Nucleic acids 
for diagnosis may be obtained from an infected individual's cells and tissues, such as bone, 
blood, muscle, cartilage, and skin. Genomic DNA may be used directly for detection or may be 
amplified enzymatically by using PGR (Saiki et al.. Nature, 324: 163-166 (1986) prior to 
analysis. RNA or cDNA may also be used in the same ways. As an example, PGR primers 
complementary to the nucleic acid fomiing part of the polynucleotide of the present invention 
can be used to identify and analyze for its presence and/or expression. Using PGR, 
characterization of the strain of 5. agalactiae present in a mammal, and especially a human, may 
be made by an analysis of the genotype of the prokaryote gene. For example, deletions and 
insertions can be detected by a change in size of the amplified product in comparison to the 
genotype of a reference sequ^ce. Point mutations can be identified by hybridising amplified 
DNA to radiolabeled RNA or alternatively, radiolabeled antisense DNA sequences. Perfectly 
matched sequences can be distinguished form mismatched duplexes by Rnase A digestion or by 
differences in melting temperatures. 

Sequence differences between a reference gene and genes having mutations also may be revealed 
by direct DNA sequencing. In addition, cloned DNA segments may be employed as probes to 
detect specific DNA segments. The sensitivity of such methods can be greatly enhanced by 
appropriate use of PGR or another amplification method. For example, a sequencing primer can 
be used with double-stranded PGR product or a single-stranded template molecule generated by 
a modified PGR. The sequence determination is performed by conventional procedures with 
radiolabled nucleotide or by automatic sequencing procedures with fluorescent-tags. 

Genetic characterization based on DNA sequence differences may be achieved by detection of 
alteration in electrophoretic mobility of DNA fragments in gels, with or without denaturing 
agents. Small sequence deletions and insertions can be visualised by high resolution gel 
electrophoresis. DNA fragments of different sequences may be distinguished on denaturing 
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foimamide gradient gels in which the mobilities of different DNA fragments are retarded in the 
• gel at different positions according to their specific melting or partial melting temperatures (see, 
e.g. Myers et al.. Science. 230: 1242 (1985)). 

Sequence changes at specific locations also may be revealed by nuclease protection assays, such 
as Rnase and SI protection or the chemical cleavage method (e. g.. Cotton et al., Proc, Natl 
Acad. ScL. USA, 85: 4397-4401 (1985)). 

Thus, the detection of a specific DNA sequence may be achieved by methods such as 
hybridization, Rnasie protection, chemical cleavage, direct DNA sequencing or the use of 
restriction enzymes, e. g., restriction fragment length polymorphisms (RFLP) and Southem 
blotting of genomic DNA. 

In addition to more conventional gel-electrophoresis and DNA sequencing, mutations also can be 
detected by in situ analysis. 

Cells carrying mutations or polymorphisms in the gene of the present invention may also be 
detected at the DNA level by a variety of techniques, to allow for serotyping, for example. For 
example, RT-PCR can be used to detect mutations. It is particularly preferred to use RT-PCR in 
conjunction with automated detection systems, such as, for example, GeneScan. RNA or cDNA 
may also be used for the same purpose, PCR or RT-PCR. As an example, PCR primers 
complementary to the nucleic acid encoding the polypeptide of the present invention can be used 
to identify and analyse mutations. The primers may be used to amplify the gene isolated from the 
individual such that the gene may then be subject to various techniques for elucidation of the 
DNA sequence. In this way, mutations in the DNA sequence may be diagnosed. 

The invention provides a process for diagnosing disease, arising from infection with S. 
agalactiae, comprising determining from a sample isolated or derived from an individual an 
increased level of expression of a polynucleotide having the sequence of a polynucleotide set 
forth in the Sequence Listing. Expression of polynucleotide can be measured using any one of 
the methods well known in the art for the quantitation of polynucleotides, such as, for example, 
PCR, RT-PCR, Rnase protection. Northern blotting, other hybridisation methods and the arrays 
described herein. 
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' The present invention also relates to vectors which comprise a polynucleotide or polynucleotides 
of the present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 

Cells can be genetically engineered to incorporate polynucleotides and express polypeptides of 
the present invention. Introduction of polynucleotides into the host cell can be effected by 
calcium phosphate transfection, DEAE-dextran mediated transfection, transvection, 
microinjection, cationic lipid-mediated transfection, electroporation, transduction, scrapie 
loading, ballistic introduction^ infection or other methods. Such methods are described in many 
standard laboratory manuals, such as Davis et al., BASIC METHODS IN MOLECULAR 
BIOLOGY, (1986) and Sambrook et aL, MOLECULAR CLONONG: A LABORATORY 
MANUAL, 2""* Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). 

Polynucleotide constructs in cells can be used in a conventional maimer to produce the gene 
product encoded by the recombinant sequence. Alternatively, the polypeptides of the invention 
can be synthetically produced by conventional peptide synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells imder the 
control of appropriate promoters. Cell-jfree translation systems can also be employed to produce 
such proteins using RNAs derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are 
described by Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2"^ 
Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). 

In accordance with this aspect of the invention the vector may be, for example, a plasmid vector, 
a single or double-stranded phage vector, a single or double-stranded RNA or DNA viral vector, 
a single or double-stranded RNA or DNA viral vector. Plasmids generally are designated herein 
by a lower case p preceded and/or followed by capital letters and/or nimiber, in accordance with 
standard naming conventions that are familiar to those of skill in the art. Starting plasmids 
disclosed herein are either commercially available, publicly available, or can be constructed from 
available plasmids by routine application of well known, published procedures, given the 
teachings herein. Many plasmids and other cloning and expression vectors that can be used in 
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accordance with the present invention are well known and readily available to those of skill in 
- the art. 

Preferred among vectors, in certain respects, are those for expression of polynucleotides and 
polypeptides of the present invention. Generally, such vectors comprise cis-acting control 
regions effective for expression in a host operatively linked to the polynucleotide to be 
expressed. Appropriate trans-acting factors either are supplied by the host, suppUed by a 
complementing vector or supphed by the vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for specific expression. Such 
specific expression may be inducible expression or expression only in certain types of cells or 
both inducible and cell-specific. Particularly preferred among inducible vectors are vectors that 
can be induced for expression by environmental factors that are easy to manipulate, such as 
temperature and nutrient additives. A variety of vectors suitable to this aspect of the invention, 
including constitutive and inducible expression vectors for use in prokaryotic and eukaryotic 
cells, are well known and OTiployed routinely by those of skill in the art. 

A great variety of expression vectors can be used to express a polypeptide of the invention. Such 
vectors include, among other, chromosomal, episomal and virus-derived vectors, e.g., vectors 
derived fiom bacterial plasmids, from bacteriophage, from transposons, from yeast episomes, 
from insertion elements, from yeast chromosomal elements, from virases such as baculoviruses, 
papova viruses, such as SV40, vaccinia viruses, adenovirases, fowl pox virases, pseudorabies 
viruses and retroviruses, and vectors derived from combinations thereof, such as those derived 
from plasmid and bacteriophage genetic elements, such as cosmids and phagemids, all may be 
used for experssion in accordance with this aspect of the present invention. Generally, any vector 
suitable to maintain, propagate or express polynucleotides to express a polypeptide in a host may 
be used for expression in this regard. 

The appropriate DNA sequence may be inserted into the vector by any of a variety of well- 
known and routine techniques, such as, for example, those set forth in Sambrook et al., 
MOLECULAR CLONING, A LABORATORY MANUAL, 2"^ Ed.; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1 989). 
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The DNA sequence in the expression vector is operatively Hnked to appropriate expression 
- control sequence(s), including, for instance, a promoter to direct mRNA transcription. 
Representatives of such promoters include, but are not limited to, the phage lambda PL 
promoter, the E.coli lac, trp and tac promoters, the SV40 early and late promoters and promoters 
of retroviral LTRs. 

In general, expression constructs will contain sites for transcription initiation and termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will include a translation initiating AUG or others 
such as GUG and UUG at the beginning and a termination codon appropriately positioned at the 
end of the polypeptide to be translated. 

In addition, the constructs may contain control regions that regulate as well as engender 
expression. Generally, in accordance with many commonly practiced procedures, such regions 
will operate by controlling transcription, such as transcription factors, repressor binding sites and 
termination, among other. 

Vectors for propagation and expression generally will include selectable markers and 
amplification regions, such as, for example, those set forth in Sambrook et al., MOLECULAR 
CLONONG, A LABORATORY MANUAL, 2""^ Ed.; Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York (1989) 

Representative examples of appropriate cells which host said vectors include bacterial cells, such 
as streptococci, staphylococci, E, coli, streptomyces and Bacillus subtiiis cells; fungal cells, such 
as yeast cells and Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; 
animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293 and Bowes melanoma cells; and 
plant cells. 

The following vectors, which are commercially available, are provided by way of example. 
Among vectors preferred for use in bacteria are pQE70, pQE60 and pQE-9, available from 
Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNHlSA, 
pNH46A, available from Stratagene; and ptrc99a, pKK223-3, Pkk233-3, pDR540, pRTTS 
available from Pharmacia, and pBR322 (ATCC 37017). Among preferred eukaryotic vectors are 
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pWLNEO, pSV2CAT, pOG44, PXTl and pSG available from Stratagene; and pSVK3, pBPV, 
pMSG and pS VL available from Phamiacia. These vectors are listed solely by way of illustration 
of the many commercially available and well known vectors that are available to those of skill in 
the art for use in accordance with this aspect of the present invention. It will be appreciated that 
any other plasmid or vector suitable for, for example, introduction, maintenance, propagation or 
expression of a polynucleotide or polypeptide of the invention in a host may be used in this 
aspect of the invention. 

Promoter regions can be selected from any desired gene using vectors that contain a reporter 
transcription unit lacking a promoter region, such as a chloramphenicol acetyl transferase 
("CAT') transcription xmit, downstream of restriction site or sites for introducing a candidate 
promoter fragment; i.e,, a fragment that may contain a promoter. As is well known, introduction 
into the vector of a promoter-containing fragment at the restriction site upstream of the cat gene 
engenders production of CAT activity, which can be detected by standard CAT assays. Vectors 
suitable to this end are well known and readily available, such as pKK232-8 and pCM7. 
Promoters for expression of polynucleotides of the present invention include not only well 
known and readily available promoters, but also promoters that readily may be obtained by the 
foregoing technique, using a reporter gene. 

Among knovra prokaryotic promoters suitable for expression of polynucleotides and 
polypeptides in accordance with the present invention are the E. coli lad and lacZ and 
promoters, the T3 and T7 promoters, the gpt promoter, the lambda PR, PL promoters and the trp 
promoter. 

Among known eukaryotic promoters suitable in this regard are the CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters 
of retroviral LTRs, such as those of the Rous sarcoma virus "(RSV"), and metallothionein 
promoters, such as the mouse metallothionein-I promoter. 

Recombinant expression vectors will include, for example, origins of replication, a promoter 
preferably derived from a highly-expressed gene to direct transcription of a downstream 
structural sequence, and a selectable marker to permit isolation of vector containing cells after 
exposure to the vector. 
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. Polynucleotides of the invention, encoding the heterologous structural sequence of a polypeptide 
of the invention generally will be inserted into the vector using standard techniques so that it is 
operably Unked to the promoter for expression. The polynucleotide will be positioned so that the 
transcription start site will be 5' to the AUG that initiates translation of the polypeptide to be 
expressed. Generally, there will be no other open reading frames that begin with an initiation 
codon, usually AUG, and lie between the ribosome binding site and the initiation codon. Also, 
generally, there will be a translation stop codon at the end of the polypeptide and there will be a 
polyadenylation signal in constructs for use in eukaryotic hosts. A transcription termination 
signal appropriately disposed at the 3 'end of the transcribed region may also be included in the 
polynucleotide construct. 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the 
periplasmic space or into the extracellular environment, appropriate secretion signals may be 
incorporated into the expressed polypeptide. 

These signals may be endogenous to the polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion protein, and may include 
not only secretion signals but also additional heterologous functional regions. Thus, for instance, 
a region of additional amino acids, particularly charged amino acids, may be added to the N- or 
C-terminus of the polypeptide to improve stability and persistence in the host cell, during 
purification or during subsequent handling and storage. Also, regions may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final preparation of 
the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or 
excretion, to improve stability or to facilitate purification, among others, are familiar and routine 
techniques in the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize or purify polypeptides. For example, EP-A-O 464 
533 (Canadian counterpart 2045869) discloses fusion proteins comprising various portions of 
constant region of immunoglobin molecules together with another protein or part thereof In drug 
discovery, for example, proteins have been fused with antibody Fc portions for the purpose of 
high-throughout screening assays to identify antagonists. See, D. Bennett et aL, Journal of 
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Molecular Recognition, Vol. 8 52-58 (1995) and K. Johanson et aL, The Journal of Biological 
- Chemistry, Vol. 270, No.16, pp 9459-9471 (1995), 

Cells typically then are harvested by centrifugation, disrupted by physical or chemical means, 
and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any convenient method, 
including fi*eeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents; 
such methods are well known to those skilled in the art. 

Mammalian expression vectors may comprise expression sequences, such as an origin of 
replication, a suitable promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenylation regions, splice donor and acceptor sites, transcriptional termination sequences, 
and 5' flanking non-transcribed sequences that are useful or necessary for expression. 

The polypeptide can be recovered and purified firom recombinant cell cultures by well-known 
methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography is employed for 
purification. Well-known techniques for refolding protein may be employed to regenerate the 
active conformation when the polypeptide is denatured during isolation and or purification. 

The polypeptides according to the present invention can be produced by chemical synthesis as 
well as by biotechnological means. The latter comprise the transfection or transformation of a 
host cell with a vector containing a nucleic acid according to the present invention and the 
cultivation of the transfected or transformed host cell under conditions, which are known to the 
ones skilled in the art. The production method may also comprise a purification step in order to 
purify or isolate the polypeptide to be manufactured. In a preferred embodiment the vector is a 
vector according to the present invention. 

In a further aspect the present invention relates to an antibody directed to any of the 
polypeptides, derivatives or firagments thereof according to the present invention. The present 
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invention includes, for example, monoclonal and polyclonal antibodies, chimeric, single chain, 
- and humanized antibodies, as well as Fab fragments, or the product of a Fab expression library. 
It is within the present invention that the antibody may be chimeric, i. e, that different parts 
thereof stem from different species or at least the respective sequences are taken from different 
species. 

Antibodies generated against the polypeptides corresponding to a sequence of the present 
invention can be obtained by direct injection of the polypeptides into an animal or by 
administering the polypeptides to an animal, preferably a non-human. The antibody so obtained 
will then bind the polypeptides itself. In this manner, even a sequence encoding only a fragment 
of the polypeptides can be used to generate antibodies binding the whole native polypeptides. 
Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
polypeptide. 

For preparation of monoclonal antibodies, any technique known in the art, which provides 
antibodies produced by continuous cell line cultures can be used. Examples include various 
techniques, such as those in Kohler, G. and Milstein, C, Nature 256: 495-497 (1975); Kozbor et 
al. Immunology Today 4: 72 (1983); Cole et al., pg. 77-96 in MONOCLONAL ANTIBODIES 
AND CANCER THERAPY, Alan R. Liss, Inc, (1985); U.S. Patent No. 5,545,403; U.S. Patent No. 
5,545,405; U.S. Patent No. 5,654,403; U.S. Patent No. 5,792,838; U.S. Patent No. 5,316,938; U, 
S. Patent No. 5,633,162; U.S. Patent No. 5,644,036; U.S. Patent No. 5,858,725. 

Techniques described for the production of single chain antibodies (U.S. Patent No. 4,946,778) 
can be adapted to produce single chain antibodies to immunogenic polypeptide products of this 
invention. Also, transgenic mice, or other organisms such as other mammals, may be used to 
express himianized antibodies to immunogenic polypeptide products of this invention. 

Alternatively, phage display technology could be utilized to select antibody genes with binding 
activities towards the polypeptide either from repertoires of PGR ampUfied v-genes of 
lymphocytes from humans screened for possessing anti-Fab or from naive libraries (McCafFerty, 
J. et al., (1990), Nature 348, 552-554; Marks, J. et al., (1992) Biotechnology 10, 779-783). The 
affinity of these antibodies can also be improved by chain shuffling (Clackson, T. et al., (1991) 
Nature 352, 624-628). 
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' If two antigen binding domains are present, each domain may be directed against a different 
epitope - termed *bispecific' antibodies. 

The above-described antibodies may be employed to isolate or to identify clones expressing the 
polypeptide or purify the polypeptide of the present invention by attachment of the antibody to a 
solid support for isolation and/or purification by affinity chromatogrsqshy. 

Thus, among others, antibodies against the polypeptide of the present invention may be 
employed to inhibit and/or treat infections, particularly bacterial infections and especially 
infections arising from 5. agalactiae. 

Polypeptide derivatives include antigenically, epitopically or immunologically equivalent 
derivatives which form a particular aspect of this invention. The term **antigenically equivalent 
derivative" as used herein encompasses a polypeptide or its equivalent which will be specifically 
recognized by certain antibodies which, when raised to the protein or polypeptide according to 
the present invention, interfere with the immediate physical interaction between pathogen and 
mammalian host. The temi "immunologically equivalent derivative" as used herein encompasses 
a peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
vertebrate, the antibodies act to interfere with the immediate physical interaction between 
pathogen and mammalian host. 

The polypeptide, such as an antigenically or immunologically equivalent derivative or a fusion 
protein thereof can be used as an antigen to immunize a mouse or other animal such as a rat or 
chicken. The fusion protein may provide stability to the polypeptide. The antigen may be 
associated, for example by conjugation, with an immunogenic carrier protein, for example 
bovine serum albumin (BSA) or keyhole limpet haemocyanin (BOLH). Alternatively, a multiple 
antigenic peptide comprising multiple copies of the protein or polypeptide, or an antigenically or 
immunologically equivalent polypeptide thereof, may be sufficiently antigenic to improve 
inunimogenicity so as to obviate the use of a carrier. 

Preferably the antibody or derivative thereof is modified to make it less immunogenic in the 
individual. For example, if the individual is human the antibody may most preferably be 
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"humanized", wherein the complimentarity determining region(s) of the hybridoma-derived 
' antibody has been transplanted into a human monoclonal antibody, for example as described in 
Jones, P. et al. (1986), Nature 321, 522-525 or Tempest at al., (1991) Biotechnology 9, 266-273. 

The use of a polynucleotide of the invention in genetic immunization will preferably employ a 
suitable delivery method such as direct injection of plasmid DNA into muscle (Wolff et al., 
(1992) Hum. Mol. Genet. 1, 363 ; Manthoipe et al., (1963) Hum. Gene Ther. 4, 419) delivery of 
DNA complexed with specific protein carriers (Wu et al., (1989) J Biol. Chem. 264, 16985), 
coprecipitation of DNA with calcium phosphate (Benvenisty & Reshef (1986) PNAS 83, 9551), 
encapsulation of DNA in various forms of liposomes (Kaneda et al., (1989) Science 243, 375), 
particle bombardment (Tang et al., (1992) Nature 356, 152; Eisenbraun et ah, (1993) DNA Cell. 
Biol. 12, 791) and in vivo infection using cloned retroviral vectors (Seeger et al., (1984) PNAS 
81, 5849). 

In a further aspect the present invention relates to a peptide binding to any of the polypeptides 
according to the present invention, and a method for the manufacture of such peptides whereby 
the method is characterized by the use of the polypeptides according to the present invention and 
the basic steps are known to the one skilled in the art. 

Such peptides may be generated by using methods according to the state of the art such as phage 
display or ribosome display. In case of phage display, basically a library of peptide is generated, 
such as in forai of phages, and this kind of libraries is contacted with the target molecule, in the 
present case the polypeptides according to the present invention. Those peptides binding to the 
target molecule are subsequently removed, preferably as a complex with the target molecule, 
from the respective reaction. It is known to the one skilled in the art that the binding 
characteristics, at least to a certain extend, depend on the particularly realized experimental set- 
up such as the salt concentration and the like. After separating those peptides binding to the 
target molecule with a higher affinity or a bigger force, from the non-binding members of the 
library, and optionally also after removal of the target molecule from the complex of target 
molecule and peptide, the respective peptide(s) may subsequently be characterised. Prior to the 
characterisation optionally an amplification step is realized such as, e. g. by propagating the 
peptide coding phages. The characterisation preferably comprises the sequencing of the target 
binding peptides. Basically, the peptides are not limited in their lengths, however, preferably 
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peptides having a lengths from about 8 to 20 amino acids are preferably obtained in the 
respective methods. The size of the libraries may be about 10^ to 10^^, preferably 10* to 10^^ 
different peptides, however, is not limited thereto. 

A particular form of target binding polypeptides are the so-called *'anticalines" which are, among 
others, described in German patent application DE 197 42 706. 

In a further aspect the present invention relates to functional nucleic acids interacting with any of 
the polypeptides according to the present invention, and a method for the manufacture of such 
functional nucleic acids whereby the method is characterized by the use of the polypeptides 
according to the present invention and the basic steps are known to the one skilled in the art.. 
The functional nucleic acids are preferably aptamers and spiegelmers. 

Aptamers are D-nucleic acids, which are either single stranded or double stranded and which 
specifically interact with a target molecule. The manufacture or selection of aptamers is, e. g., 
described in European patent EP 0 533 838. Basically the following steps are realized. First, a 
mixture of nucleic acids, i. e. potential aptamers, is provided whereby each nucleic acid typically 
comprises a segment of several, preferably at least eight subsequent randomised nucleotides. 
This mixture is subsequently contacted with the target molecule whereby the nucleic acid(s) 
binds to the target molecule, such as based on an increased affinity towards the target or with a 
bigger force thereto, compared to the candidate mixture. The binding nucleic acid(s) are/is 
subsequently separated from the remainder of the mixture. Optionally, the thus obtained nucleic 
acid(s) is amplified using, e.g. polymerase chain reaction. These steps may be repeated several 
times giving at the end a mixtiure having an increased ratio of nucleic acids specifically binding 
to the target from which the final binding nucleic acid is then optionally selected. These 
specifically binding nucleic acid(s) are referred to as aptamers. It is obvious that at any stage of 
the method for the generation or identification of the aptamers samples of the mixture of 
individual nucleic acids may be taken to detemiine the sequence thereof using standard 
techniques. It is within the present invention that the aptamers may be stabilized such as, e. g., by 
introducing defined chemical groups which are knoAvn to the one skilled in the art of generating 
aptamers. Such modification may for example reside in the introduction of an amino group at the 
2'-position of the sugar moiety of the nucleotides. Aptamers are currently used as therapeutical 
agens. However, it is also within the present invention that the thus selected or generated 
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aptamers may be used for target validation and/or as lead substance for the development of 
medicaments, preferably of medicaments based on small molecules. This is actually done by a 
competition assay whereby the specific interaction between the target molecule and the aptamer 
is inhibited by a candidate dmg whereby upon replacement of the aptamer from the complex of 
target and aptamer it may be assumed that the respective drug candidate allows a specific 
inhibition of the interaction between target and aptamer, and if the interaction is specific, said 
candidate drug will, at least in principle, be suitable to block the target and thus decrease its 
biological availability or activity in a respective system comprising such target. The thus 
obtained small molecule may then be subject to further derivatisation and modification to 
optimise its physical, chemical, biological and/or medical characteristics such as toxicity, 
specificity, biodegradability and bioavailability. 

Spiegelmers and their generation or manufacture is based on a similar principle. The 
manufacture of spiegekners is described in international patent application WO 98/08856. 
Spiegelmers are L-nucleic acids, which means that they are composed of L-nucleotides rather 
than D-nucleotides as aptamers are. Spiegelmers are characterized by the fact that they have a 
very high stability in biological systems and, comparable to aptamers, specifically interact with 
the target molecule against which they are directed. In the process of generating spiegelmers, a 
heterogeneous population of D-nucleic acids is created and this population is contacted with the 
optical antipode of the target molecule, in the present case for example with the D-enantiomer of 
the naturally occurring L-enantiomer of the polypeptides according to the present invention. 
Subsequently, those D-nucleic acids are separated which do not interact with the optical antipode 
of the target molecule. But those D-nucleic acids interacting with the optical antipode of the 
target molecule are separated, optionally determined and/or sequenced and subsequently the 
corresponding L-nucleic acids are synthesized based on the nucleic acid sequence information 
obtained from the D-nucleic acids. These L-nucleic acids, which are identical in tCTns of 
sequence with the aforementioned D-nucleic acids interacting with the optical antipode of the 
target molecule, will specifically interact with the naturally occurring target molecule rather than 
with the optical antipode thereof. Similar to the method for the generation of aptamers it is also 
possible to repeat the various steps several times and thus to enrich those nucleic acids 
specifically interacting with the optical antipode of the target molecule. 
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In a further aspect the present invention relates to functional nucleic acids interacting with any of 
the nucleic acid molecules according to the present invention, and a method for the manufacture 
of such functional nucleic acids whereby the method is characterized by the use of the nucleic 
acid molecules and their respective sequences according to the present invention and the basic 
steps are known to the one skilled in the art. The functional nucleic acids are preferably 
ribozymes, antisense oligonucleotides and siRNA. 

Ribozymes are catalytically active nucleic acids, which preferably consist of RNA which 
basically comprises two moieties. The first moiety shows a catalytic activity whereas the second 
moiety is responsible for the specific interaction with the target nucleic acid, in the present case 
the nucleic acid coding for the polypeptides according to the present invention. Upon interaction 
between the target nucleic acid and the second moiety of the ribozyme, typically by hybridisation 
and Watson-Crick base pairing of essentially complementary stretches of bases on the two 
hybridising strands, the catalytically active moiety may become active which means that it 
catalyses, either intramolecularly or intermolecularly, the target nucleic acid in case the catalytic 
activity of the ribozyme is a phosphodiesterase activity. Subsequently, there may be a further 
degradation of the target nucleic acid which in the end results in the degradation of the target 
nucleic acid as well as the protein derived from the said target nucleic acid. Ribozymes, their use 
and design principles are known to ttie one sldlled in the art, and, for example described in 
Doherty and Doudna ((2001) Ribozym stmctures and mechanism. Amiu. Rev. Biophys. 
Biomolstmct. 30, 457-475) and Lewin and Hanswirth (Ribozyme Gene Therapy: Applications 
for molecular medicine. 2001 7: 221-8). 

The activity and design of antisense oligonucleotides for the manufacture of a medicament and 
as a diagnostic agent, respectively, is based on a similar mode of action. Basically, antisense 
oligonucleotides hybridise based on base complementarity, with a target RNA, preferably with a 
mRNA, thereby activate RNase H. RNase H is activated by both phosphodiester and 
phosphorotiiioate-coupled DNA. Phosphodiester-coupled DNA, however, is rapidly degraded by 
cellular nucleases with the exception of phosphorothioate-coupled DNA. These resistant, non- 
naturally occurring DNA derivatives do not inhibit RNase H upon hybridisation with RNA. In 
other words, antisense polynucleotides are only effective as DNA RNA hybride complexes. 
Examples for this kind of antisense oligonucleotides are described, among others, in US-patent 
US 5,849,902 and US 5,989,912, In other words, based on the nucleic acid sequence of the target 
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molecule which in the present case are the nucleic acid molecules for the polypeptides according 
- to the present invention, either from the target protein from which a respective nucleic acid 
sequence may in principle be deduced, or by knowing the nucleic acid sequence as such, 
particularly the mRNA, suitable antisense oligonucleotides may be designed based on the 
principle of base complementarity. 

Particularly preferred are antisense-oligonucleotides, which have a short stretch of 
phosphorothioate DNA (3 to 9 bases). A minimum of 3 DNA bases is required for activation of 
bacterial RNase H and a minimum of 5 bases is required for mammalian RNase H activation. In 
these chimeric oligonucleotides there is a central region that fomis a substrate for RNase H that 
is flanked by hybridising "arms'" comprised of modified nucleotides that do not form substrates 
for RNase H. The hybridising arms of the chimeric oligonucleotides may be modified such as by 
2'-0-methyl or 2'-fluoro. Altemative £q)proaches used methylphosphonate or phosphoramidate 
linkages in said arms. Further embodiments of the antisense oligonucleotide useful in the 
practice of the present invention are P-methoxyoligonucleotides, partial P- 
methoxyoligodeoxyribonucleotides or P-methoxyoligonucleotides. 

Of particular relevance and usefiilness for the present invention are those antisense 
oligonucleotides as more particularly described in the above two mentioned US patents. These 
oligonucleotides contain no naturally occurring 5' 3 '-linked nucleotides. Rather the 
oligonucleotides have two types of nucleotides: 2'-deoxyphosphorothioate, which activate 
RNase H, and 2 '-modified nucleotides, which do not. The linkages between the 2 '-modified 
nucleotides can be phosphodiesters, phosphorothioate or P-ethoxyphosphodiester. Activation of 
RNase H is accomphshed by a contiguous RNase H-activating region, which contains between 3 
and 5 2'-deoxyphosphorothioate nucleotides to activate bacterial RNase H and between 5 and 10 
2'- deoxyphosphorothioate nucleotides to activate eucaryotic and, particularly, mammalian 
RNase H. Protection from degradation is accomplished by making the 5' and 3' terminal bases 
highly nuclease resistant and, optionally, by placing a 3' terminal blocking group. 

More particularly, the antisense oligonucleotide comprises a 5' terminus and a 3' terminus; and 
from 1 1 to 59 5' 3 '-linked nucleotides independently selected from the group consisting of 2'- 
modified phosphodiester nucleotides and 2 '-modified P-alkyloxyphosphotriest^ nucleotides; 
and wherein the 5'-temiinal nucleoside is attached to an RNase H-activating region of between 
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three and ten contiguous phosphorothioate-linked deoxyribonucleotides, and wherein the 3'- 
- terminus of said oligonucleotide is selected from the group consisting of an inverted 
deoxyribonucleotide, a contiguous stretch of one to three phosphorothioate 2 '-modified 
ribonucleotides, a biotin group and a P-alkyloxyphosphotriester nucleotide. 

Also an antisense oligonucleotide may be used wherein not the terminal nucleoside is attached 
to an RNase H-activating region but the 3' terminal nucleoside as specified above. Also, the 5' 
terminus is selected from the particular group rather than the 3 ' terminus of said oligonucleotide. 

The nucleic acids as well as the polypeptides according to the present invention may be used as 
or for the manufacture of vaccines. Preferably such vaccine is for the prevention or treatment of 
diseases caused by, related to or associated with GBS. In so far another aspect of the invention 
relates to a method for inducing an immunological response in an individual, particularly a 
mammal, which comprises inoculating the individual with the polypeptide of the invention, or a 
fragment or variant thereof, adequate to produce antibody to protect said individual from 
infection, particularly bacterial infection and most particularly Streptococcus infections. 

Yet another aspect of the invention relates to a method of inducing immunological response in an 
individual which comprises, through gene then^y or otherwise, deUvering a nucleic acid 
functionally encoding the polypeptide, or a fragment or a variant thereof, for expressing the 
polypeptide, or a fragment or a variant thereof in vivo in order to induce an immunological 
response to produce antibodies or a cell mediated T cell response, either cytokine-producing T 
cells or cytotoxic T cells, to protect said individual from disease, whether that disease is already 
established within the individual or not. One way of administering the gene is by accelerating it 
into the desired cells as a coating on particles or otherwise. 

A further aspect of the invention relates to an immunological composition which, when 
introduced into a host capable of having induced within it an immunological response, induces 
an immunological response in such host, wherein the composition comprises recombinant DNA 
which codes for and expresses an antigen of the polypeptide of the present invention. The 
immimological response may be used therapeutically or prophylactically and may take the form 
of antibody immunity or cellular immunity such as that arising from CTL or CD4+ T cells. 
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The polypeptide of the invention or a fragment thereof may be fused with co-protein, which may 
not by itself produce antibodies, but is capable of stabilizing the first protein and producing a 
fused protein, which will have iiranunogenic and protective properties. This fused recombinant 
protein preferably further comprises an antigenic co-protein, such as Glutathione-S-transferase 
(GST) or beta-galactosidase, relatively large co-proteins which solubilise the protein and 
facilitate production and purification thereof. Moreover, the co-protein may act as an adjuvant in 
the sense of providing a generalized stimulation of the immune system. The co-protein may be 
attached to either the amino or carboxy terminus of the first protein. 

Provided by this invention are compositions, particularly vaccine compositions, and methods 
comprising the polypeptides or polynucleotides of the invention and immunostimulatory DNA 
sequences, such as those described in Sato, Y. et al., Science 273: 352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or particular 
fragments thereof which have been shown to encode non-variable regions of bacterial cell 
surface proteins in DNA constructs used in such genetic immunization experiments in animal 
models of infection with S. agalactiae. Such fragments will be particularly useful for identifying 
protein epitopes able to provoke a prophylactic or therapeutic immune response. This approach 
can allow for the subsequent preparation of monoclonal antibodies of particular value from the 
requisite organ of the animal successfully resisting or clearing infection for the development of 
prophylactic agents or therapeutic treatments of 5. agalactiae infection in mammals, particularly 
humans. 

The polypeptide may be used as an antigen for vaccination of a host to produce specific 
antibodies which protect against invasion of bacteria, for example by blocking adherence of 
bacteria to damaged tissue. Examples of tissue damage include wounds in skin or connective 
tissue caused e.g. by mechanical, chemical or thermal damage or by implantation of indwelling 
devices, or wounds in the mucous membranes, such as the mouth, mammary glands, urethra or 
vagina. 

The present invention also includes a vaccine formulation, which comprises the immunogenic 
recombinant protein together with a suitable carrier. Since the protein may be broken down in 
the stomach, it is preferably administered parenterally, including, for example, administration 
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that is subcutaneous, intramuscular, intravenous, or intradermal. Formulations suitable for 
^ parenteral administration include aqueous and non-aqueous sterile injection solutions which may 
contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation isotonic 
with the bodily fluid, preferably the blood, of the individual; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents or thickening agents. The formulations 
may be presented in unit-dose or multi-dose containers, for example, sealed ampoules and vials, 
and may be stored in a freeze-dried condition requiring only the addition of the sterile liquid 
carrier immediately prior to use. The vaccine formulation may also include adjuvant systems for 
enhancing the immunogenicity of the formulation, such as oil-in-water systems and other 
systems known in the art. The dosage will depend on the specific activity of the vaccine and can 
be readily determined by routine experimentation. 

It is also within the present invention that the vaccine comprises apart from the polypeptide 
and/or nucleic acid molecule according to the present invention other compounds, which are 
biologically or pharmaceutically active. Preferably, the vaccine composition comprises at least 
one polycationic pq>tide. The polycationic compound(s) to be used according to the present 
invention may be any polycationic compound, which shows the characteristic effects according 
to the WO 97/30721. Preferred polycationic compounds are selected from basic polypeptides, 
organic polycations, basic polyamino acids or mixtures thereof These polyamino acids should 
have a chain length of at least 4 amino acid residues (WO 97/30721). Especially preferred are 
substances like polylysine, polyarginine and polypeptides containing more than 20 %, especially 
more than 50 % of basic amino acids in a range of more than 8, especially more than 20, amino 
acid residues or mixtures thereof Other preferred polycations and their pharmaceutical 
compositions are described in WO 97/30721 (e. g. polyethyleneimine) and WO 99/38528. 
Preferably these polypeptides contain between 20 and 500 amino acid residues, especially 
between 30 and 200 residues. 

These polycationic compounds may be produced chemically or recombinantly or may be derived 
from natural sources. 

Cationic (poly)peptides may also be anti-microbial with properties as reviewed in Ganz et al., 
1999; Hancock, 1999. These (poly)peptides may be of prokaryotic or animal or plant origin or 
may be produced chemically or recombinantly (WO 02/13857). Peptides may also belong to the 
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class of defensins (WO 02/13857). Sequences of such peptides can, for example, be found in the 
- Antimicrobial Sequences Database under the following internet address: 

http://www,bbcm.imiv.trieste.it/-tossi/pag2.html 

Such host defence peptides or defensives are also a preferred form of the polycationic polymer 
according to the present invention. Generally, a compound allowing as an end product activation 
(or down-regulation) of the ad^^tive immune system, preferably mediated by APCs (including 
dendritic cells) is used as polycationic polymer. 

Especially preferred for use as polycationic substances in the present invention are cathelicidin 
derived antimicrobial peptides or derivatives thereof (International patent application WO 
02/13857, incorporated herein by reference), especially antimicrobial peptides derived from 
mammal cathelicidin, preferably from human, bovine or mouse. 

Polycationic compounds derived from natural sources include HTV-REV or HIV-TAT (derived 
cationic peptides, antennapedia peptides, chitosan or odier derivatives of chitin) or other peptides 
derived from these peptides or proteins by biochemical or recombinant production. Other 
preferred polycationic compounds are cathelin or related or derived substances from cafhelin. 
For example, mouse cathelin is a peptide, which has the amino acid sequence NH2- 
RLAGLLRKGGEKIGEKLKKIGOKIKNFFQKLVPQPE.COOH. Related or derived cathelin 
substances contain the whole or parts of the cathelin sequence with at least 15-20 amino acid 
residues. Derivations may include the substitution or modification of the natural amino acids by 
amino acids which are not among the 20 standard amino acids. Moreover, further cationic 
residues may be introduced into such cathelin molecules. These cathelin molecules are preferred 
to be combined with the antigen. These cathelin molecules surprisingly have turned out to be 
also effective as an adjuvant for an antigen without the addition of further adjuvants. It is 
therefore possible to use such cathelin molecules as efficient adjuvants in vaccine formulations 
with or without further immimactivating substances. 

Another preferred polycationic substance to be used according to the present invention is a 
synthetic peptide containing at least 2 KLK-motifs separated by a linker of 3 to 7 hydrophobic 
amino acids (Intemational patent application WO 02/32451, incorporated herein by reference). 
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*' The pharmaceutical composition of the present invention may further comprise 
Immunostimulatory nucleic acid(s). Immunostimulatory nucleic acids are e.g. neutral or 
artificial CpG containing nucleic acids, short stretches of nucleic acid derived from non- 
vertebrates or in form of short oligonucleotides (ODNs) containing non-methylated cytosine- 
guanine di-nucleotides (CpG) in a certain base context (e.g. described in WO 96/02555). 
Alternatively, also nucleic acids based on inosine and cjrtidine as e.g. described in the WO 
01/93903, or deoxynucleic acids containing deoxy-inosine and/or deoxyuridine residues 
(described in WO 01/93905 and PCT/EP 02/05448, incorporated herein by reference) may 
preferably be used as immimostimulatory nucleic acids for the present invention. Preferably, the 
mixtures of different immunostimulatory nucleic acids may be used according to the present 
invention. 

It is also within the present invention that any of the aforementioned polycationic compounds is 
combined with any of the immunostimulatory nucleic acids as aforementioned. Preferably, such 
combinations are according to the ones as described in WO 01/93905, WO 02/32451, WO 
01/54720, WO 01/93903, WO 02/13857 and PCT/EP 02/05448 and the Austrian patent 
application A 1924/2001, incorporated herein by reference. 

In addition or alternatively such vaccine composition may comprise apait from the 
polypeptide/nucleic acid molecules according to the present invention a neuroactive compound. 
Preferably, the neuroactive compound is human growth factor as, e.g. described in WO 
01/24822. Also preferably, the neuroactive compoxmd is combined with any of the polycationic 
compounds and/or immimostimulatory nucleic acids as aforementioned. 

In a further aspect the present invention is related to a pharmaceutical composition. Such 
pharmaceutical composition is, for example, the vaccine described herein. Also a pharmaceutical 
composition is a pharmaceutical composition which comprises any of the following compounds 
or combinations threreof: the nucleic acids according to the present invention, the polypeptides 
according to the present invention, the vector according to the present invention, the cells 
according to the present invention, the antibody according to the present invention, the frmctional 
nucleic acids according to the present invention and the binding peptides such as the anticalines 
according to the present invention, any agonists and antagonists screened as described herein. In 
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connection therewith any of these compounds may be employed in combination with a non- 
sterile or sterile carrier or carriers for use with cells, tissues or organisms, such as a 
pharmaceutical carrier suitable for administration to a subject. Such compositions comprise, for 
instance, a media additive or a therapeutically effective amount of a polypeptide of the invention 
and a pharmaceutically acceptable carrier or excipient. Such carriers may include, but are not 
limited to, saline, buffered saline, dextrose, water, glycerol, ethanol and combinations thereof 
The formulation should suit the mode of administration. 

The pharmaceutical compositions may be administered in any effective, convenient manner 
including, for instance, administration by topical, oral, anal, vaginal, intravenous, intraperitoneal, 
intramuscular, subcutaneous, intranasal or intradermal routes among others. 

The pharmaceutical compositions generally are administered in an amount effective for 
treatment or prophylaxis of a specific indication or indications. In gmeral, the compositions are 
administered in an amount of active agent of at least about 10 jig/kg body weight. In most cases 
they will be administered in one or more doses in an amount not in excess of about 8 mg/kg body 
wei^t per day. Preferably, in most cases, dose is from about 10 jig/kg to about 1 mg/kg body 
weight, daily. For administration particularly to mammals, and particularly humans, it is 
expected that the daily dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg and 
typically around 1 mg/kg. For example, a dose may be 1 mg/kg daily. It will be appreciated that 
optimxmi dosage will be detemiined by standard methods for each treatment modality and 
indication, taking into account the indication, its severity, route of administration, complicating 
conditions and the like. The physician in any event will determine the actual dosage, which will 
be most suitable for an individual and will vary with the age, weight and response of the 
particular individual. The above dosages are exemplary of the average case. There can, of 
course, be individual instances where higher or lower dosage ranges are merited, and such are 
within the scope of this invention. 

In therapy or as a prophylactic, the active agent may be administered to an individual as an 
injectable composition, for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be fomiulated for topical application, for example in the form 
of ointments, creams, lotions, eye ointments, eye drops, ear drops, mouthwash, impregnated 
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dressings and sutures and aerosols, and may contain appropriate conventional additives, 
including, for example, preservatives, solvents to assist drug penetration, and emollients in 
ointments and creams. Such topical fomiulations may also contain compatible conventional 
carriers, for example cream or ointment bases, and ethanol or oleyl alcohol for lotions. Such 
carriers may constitute from about 1 % to about 98 % by weight of the formulation; more usually 
they will constitute up to about 80 % by weight of the formulation. 

The pharmaceutical composition may be administered in conjunction with an in-dwelling device. 
In-dwelling devices include surgical implants, prosthetic devices and catheters, i. e., devices that 
are introduced to the body of an individual and remain in position for an extended time. Such 
devices include, for example, artificial joints, heart valves, pacemakers, vascular grafts, vascular 
catheters, cerebrospinal fluid shunts, urinary catheters, continuous ambulatory peritoneal dialysis 
(CAPD) catheters, etc. 

The composition of the invention may be administered by injection to achieve a systematic effect 
against relevant bacteria shortly before insertion of an in-dwelling device. Treatment may be 
continued after surgery during the in-body time of the device. In addition, the composition could 
also be used to broaden perioperative cover for any surgical technique to prevent Streptococcus 
infections. 

Many orthopaedic surgeons consider that himians with prosthetic joints should be considered for 
antibiotic prophylaxis before dental treatment that could produce a bacteremia. Late deq) 
infection is a serious complication sometimes leading to loss of the prosthetic joint and is 
accompanied by significant morbidity and mortaUty. It may therefore be possible to extend the 
use of the active agent as a replacement for prophylactic antibiotics in this situation. 

In addition to the therapy described above, the compositions of this invention may be used 
generally as a wound treatment agent to prevent adhesion of bacteria to matrix proteins exposed 
in woimd tissue and for prophylactic use in dental treatment as an alternative to, or in 
conjimction with, antibiotic prophylaxis. 
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Alternatively, the composition of the invention may be used to bathe an indwelling device 
- immediately before insertion. The active agpnt will preferably be present at a concentration of 1 
jig/ml to 10 mg/ml for bathing of wounds or indwelling devices, 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be 
employed to enhance the immune response. A suitable imit dose for vaccination is 0.5-5 fig/kg of 
antigen, and such dose is preferably administered 1-3 times and with an interval of 1-3 weeks. 

With the indicated dose range, no adverse toxicological effects should be observed with the 
compounds of the invention, which would preclude their administration to suitable individuals. 

The antibodies described above may also be used as diagnostic reagents to detect the presence of 
bacteria containing the polypeptides according to the present invention. 

In a further embodiment the present invention relates to diagnostic and pharmaceutical packs and 
kits comprising one or more containers filled with one or more of the ingredients of the 
aforementioned compositions of the invention. The ingredient(s) can be present in a useful 
amoimt, dosage, formulation or combination. Associated with such container(s) can be a notice 
in the form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, reflecting approval by the agency of the manufacture, 
use or sale of the product for human administration. 

In coimection with the present invention any disease related use as disclosed herein such as, e. g. 
use of the pharmaceutical composition or vaccine, is particularly a disease or diseased condition 
which is caused, linked or associated with Gram-positive bacteria, more particularly bacteria 
selected firom the group comprising Streptococci, Staphylococci and Lactococci. More 
preferably, the microorganisms are selected fi:om the group comprising S. agalaciiae^ S. 
pyogenes^ S. pneumoniae and S. mutans. In connection therewith it is to be noted that 5. 
agalactiae comprises several strains including those disclosed herein. Also, the disease may be 
particularly a disease occurring in any patient selected firom the group comprising people with 
chronic illness such as diabetes mellitus and liver failure, pregnant women, the fetus and the 
newbom. A disease related, caused or associated with the bacterial infection to be prevented 
and/or treated according to the present invention includes in neonates sepsis, pneumonia and 
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meningitis, and in adults sepsis and soft tissue infections. Pregnancy-related infections are sepsis, 
amnionitis, urinary tract infection and stillbirth. 

In a still further embodiment the present invention is related to a screening method using any of 
the polypeptides or nucleic acids according to the present invention. Screening methods as such 
are known to the one skilled in the art and can be designed such that an agonist or an antagonist 
is screened. Preferably an antagonist is screened which in the present case inhibits or prevents 
the binding of any polypeptide according to the present invention to an interaction partner. Such 
interaction partner can be a naturally occiuring interaction partner or a non-naturally occurring 
interaction partner. Preferable the interaction partner is fibrinogen or a fragment thereof in case 
of FbsA or any host cell in case of PabA, PabB, PabC, and PabD, including epithelial cells, 
preferably human epithelial cells. 

The invention also provides a method of screening compounds to identify those, which enhance 
(agonist) or block (antagonist) the function of polypeptides or polynucleotides of the present 
invention, such as its interaction with a binding molecule. The method of screening may involve 
high-throughput. 

For example, to screen for agonists or antagonists, the interaction partner of the polynucleotide 
and nucleic acid, respectively, according to the present invention, a synthetic reaction mix, a 
cellular compartment, such as a membrane, cell envelope or cell wall, or a preparation of any 
thereof, may be prepared from a cell that expresses a molecule that binds to the polypeptide of 
the present invention. The preparation is incubated with labelled polypeptide in the absence or 
the presence of a candidate molecule, which may be an agonist or antagonist. The ability of the 
candidate molecule to bind the binding molecule is reflected in decreased binding of the labelled 
ligand. Molecules, which bind gratuitously, i. e., without inducing the functional effects of the 
polypeptide, are most likely to be good antagonists. Molecules that bind well and elicit 
functional effects that are the same as or closely related to the polypeptide are good agonists. 

The functional effects of potential agonists and antagonists may by measured, for instance, by 
detemiining activity of a reporter system following interaction of the candidate molecule with a 
cell or appropriate cell preparation, and comparing the effect with that of the polypeptide of the 
present invention or molecules that elicit the same effects as the polypeptide. Reporter systems 
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that may be useful in the regard include but are not limited to colorimetric labelled substrate 
converted into product, a reporter gene that is responsive to changes in the functional activity of 
the polypeptide, and binding assays known in the art. 

Another example of an assay for antagonists is a competitive assay that combines the 
polypeptide of the present invention and a potential antagonist with membrane-bound binding 
molecules, recombinant binding molecules, natural substrates or ligands, or substrate or ligand 
mimetics, under appropriate conditions for a competitive inhibition assay. The polypeptide can 
be labelled such as by radioactivity or a colorimetric compoxmd, such that the nimfiber of 
polypeptide molecules bound to a binding molecule or converted to product can be determined 
accurately to assess the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies that 
bind to a polypeptide of the invention and thereby inhibit or extinguish its acitivity. Potential 
antagonists also may be small organic molecules, a peptide, a polypeptide such as a closely 
related protein or antibody that binds to the same sites on a binding molecule without inducing 
functional activity of the polypeptide of the invention. 

Potential antagonists include a small molecule, which binds to and occupies the binding site of 
the polypeptide thereby preventing binding to cellular binding molecules, such that normal 
biological activity is prevented. Examples of small molecules include but are not limited to small 
organic molecules, peptides or peptide-like molecules. 

Other potential antagonists include antisense molecules (see Okano, J. Neurochem, 56:560 
(1991); OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE: 
EXPRESSION; CRC Press, Boca Raton, FL (1988), for a description of these molecules). 

Preferred potential antagonists include derivatives of the polypeptides of the invention. 

As used herein the activity of a polypeptide according to the present invention is its capability to 
bind to any of its interaction partner or the extent of such capability of its binding to its or any 
interaction partner. 
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In a particular aspect, the invention provides the use of the polypeptide, polynucleotide or 
inhibitor of the invention to interfere with the initial physical interaction between a pathogen and 
mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: i) in the prevention of adhesion of S. agalactiae to mammalian 
extracellular matrix proteins on in-dwelling devices or to extracellular matrix proteins in 
wounds; ii) to block protein mediated manunalian cell invasion by, for example, initiating 
phosphorylation of mamumalian tyrosine kinases (Rosenshire et al.. Infect Immun. 60:2211 
(1992)). iii) to block bacterial adhesion between mammalian extracellular matrix proteins and 
bacterial proteins which mediate tissue damage; iv) to block the normal progression of 
pathogenesis in infections initiated other than by the implantation of in-dwelling devices or by 
other surgical techniques. 

Each of the DNA coding sequence provided herein may be used in the discovery and 
development of antibacterial compounds. The encoded protein upon expression can be used as a 
target for the screening of antibacterial drugs. Additionally, the DNA sequences encoding the 
amino terminal regions of the encoded protein or Shine-Delgamo or other translation facilitating 
sequences of the respective mRNA can be used to construct antisense sequences to control the 
expression of the coding sequence of interest. 

The antagonists and agonists may be employed, for instance, to inhibit diseases arising &om 
infection with Streptococcus, especially S. agalactiae^ such as sepsis. 

In a still further aspect the present invention is related to an affinity device such affinity device 
comprises at least a support material and any of the polypeptides according to the present 
invention, which is attached to the support material. Because of the specificity of the 
polypeptides according to the present invention for their target cells or target molecules or their 
interaction partners, the polypeptides allow a selective removal of their interaction partner(s) 
jfrom any kind of sample applied to the support material provided that the conditions for binding 
are met. The sample may be a biological or medical sample, including but not limited to, 
fermentation broth, cell debris, cell preparation, tissue preparation, organ preparation, blood, 
urine, lymph liquid, liquor and the like. 
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The polypq)tide may be attached to the matrix in a covalent or non-covalent manner. Suitable 
- support material is known to the one skilled in the art and can be selected from the group 
comprising cellulose, silicon, glass, aluminiimi, paramagnetic beads, starch and dextrane. 

The present invention is further illustrated by the following figures, examples and the sequence 
listing from which ftuther features, embodiments, and advantages may be taken. It is to be 
understood that the present examples are give by way of illustration only and not by way of 
limitation of the disclosure. 

In connection with the present invention 

Fig. 1 shows the DNA sequence of the fbsA-encoding region and the deduced FbsA 
protein from the serotype in GBS strain 6313; 

Fig. 2 the result of a Southern Blot analysis; 

Fig. 3 the DNA sequence of the ^5^-encoding region and the deduced FbsA proteiii 
from the serotype la GBS strain 706 S2; 

Fig. 4 the DNA sequence of the y&^>4-encoding region and the deduced FbsA protein 
from the serotype lb GBS strain 33H1 A; 

Fig. 5 the DNA sequence of the yb^yl-encoding region and the deduced FbsA protein 
from the serotype II GBS strain 176 H4A; 

Fig. 6 the DNA sequence of the JbsA-encoding region and the deduced FbsA protein 
from the capsule GBS mutant O90R; 

Fig. 7 the DNA sequence of the y65i4-encoding region and the deduced FbsA protein 
bom the serotype V GBS strain SSI 169; 
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Fig. 8 a schematic comparison of the FbsA proteins from the GBS strains 6313 (serotype 
ni), 706 S2 (serotype la), 33H1A (serotype lb), 0176 H4A (serotype H), O90R 
(derived from serotype la) and SSI 169 (serotype V), respectively; 

Fig. 9 the result of a Western blot analysis of tmncated FbsA derivatives to identify the 
fibrinogen binding domain in FbsA; 

Fig. 10 a diagram illtistrating the competitive inhibition of fibrinogen binding to GBS 
6313 by the purified fiision proteins FbsA- 19, FbsA-9 and Bsp, respectively; 

Fig. 1 1 the result of a spot membrane analysis of fibrinogen binding by synthetic peptides 
derived from the repeat unit of FbsA; 

Fig. 12 the result of a spot membrane analysis of the fibrinogen binding repeat unit; 

Fig. 13 a diagram illustrating the competitive inhibition of fibrinogen binding to GBS 
6313 by synthetic pq)tides; 

Fig. 14 a diagram illustrating eukaryotic cell adherence (A) and invasion (B) of GBS 
strains 6313, 706 S2, and O90R and their respective fbsA deletion mutants; 

Fig. 15 the result of a peptide ELISA of FbsA peptides with hiunan sera; 

Fig. 16 the DNA sequence of the pabA/B-encoding region and the deduced PabA (nt 319- 
2964) and PabB (nt 3087-5111) proteins from GBS 6313; 

Fig. 17 the DNA sequence of the paftC/D-encoding region and the deduced PabC (nt 487- 
2394) and PabD (nt 2461-3006) proteins from GBS 6313; 

Fig. 18 a picture from a scanning electron microscopy of A549 cells; 

Fig. 19 a diagram illustrating the adherence of GBS 6313 to and invasion of A549 cells in 
the presence of lOO^g/ml of PabA, PabB, PabC or PabD fusion proteins; 
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Fig. 20 a diagram illurstrating eukaryotic cell adherence and internalization by GBS 6313 
and its pabA and pabB deletion mutants; 

Fig. 21 the result of a Westem Blot testing anti-PabA, anti-PabB, and anti-PabD antisera 
for their sensitivity; 

Fig. 22 a Westem blot analysis of culture supernatant of different S. agalactiae strains and 
their isogenic JbsA deletion mutants for the presence of fibrinogen binding 



Fig. 23 the binding of different S, agalactiae strains and their JbsA deletion mutants to 
immobilized fibrinogen; 

Fig. 24 the adherence and internalization of different S. agalactiae strains and their 
isogenic fbsA mutants into the limg epitheUal cell line A549; 

Fig. 25 the adherence and internalization of the S. agalactiae strains 6313 and 6313A/&5>4 
into the fibroblast cell line HEL299; 

Fig. 26 the influence of FbsA protein on the adherence of 5. agalactiae to A549 cells; 

Fig. 27 the binding of FbA-coated latex beads to human A549 cells; 

Fig. 28 the transcriptional organization of the pafcC-encoding region of 5. agalactiae; 

Fig. 29 the PCR-analysis of GBS strains for the presence of pabC and pabD genes; 

Fig, 30 the comparison of the amino acid sequences of the PabC proteins ftom different 
5. agalactiae strains; 



proteins; 
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Fig. 31 the restriction map of the pabC-cncoding region, the Western blot analysis of 
PabC and Gbs0851 fusion proteins for fibrinogen-binding and the identification of 
the FbsA and PabC-binding sites within human fibrinogen; 

Fig. 32 the binding of recombinant PabC fusion proteins to immobilized fibrinogen by 
ELISA, 
and 

Fig. 33 the adherence and invasion of the lung epithelial cell line A549 by the S. 
agalactiae pabC strains. 

The figures to which it might be referred to in the specification are described in the following in 
more detail. 

Fig. 1 shows the DNA sequence of the yb5i4-encoding region and the deduced FbsA protein firom 
the serotype III GBS strain 6313. The putative ribosomal binding site (RBS) is underlined and 
the potential transcriptional terminator is indicated by antiparallel arrows. Within the deduced 
FbsA protein, letters in bold and italic indicate die putative signal peptide sequence and letters in 
bold and underlined mark the cell wall anchor motif LPKTG. Repeats in FbsA are numbered and 
marked by arrows. 

Fig. 2 shows a Southern blot analysis to determine the presence of the fbsA gene in different 
clinical isolates of GBS. Chromosomal DNA tcom different GBS strains belonging to serotypes 
la, lb, n, m, IV, and V, respectively was digested with HinAJR and, after size separation and 
blotting onto nylon membrane, hybridised with a digoxigenin-labelled fbsA-sgeci^c DNA probe. 

Fig. 3 shows the DNA sequence of the fbsA-^ncoding region and the deduced FbsA protein fi-om 
the serotype la GBS strain 706 S2. The putative ribosomal binding site (RBS) is underlined and 
the potential transcriptional terminator is indicated by antiparallel arrows. Within the deduced 
FbsA protein, letters in bold and italic indicate the putative signal peptide sequence and letters in 
bold and underlined mark the cell wall anchor motif LPKTG. Repeats in FbsA are numbered and 
marked by arrows. 
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Fig. 4 shows the DNA sequence of the JbsA-Qxicoding region and the deduced FbsA protein from 
the serotype lb GBS strain 33H1 A. The putative ribosomal binding site (RBS) is underlined and 
the potential transcriptional terminator is indicated by antiparallel arrows. Within the deduced 
FbsA protein, letters in bold and italic indicate the putative signal peptide sequence and letters in 
bold and underlined mark the cell wall anchor motif LPKTG. Repeats in FbsA are numbered and 
marked by arrows. 

Fig. 5 shows the DNA sequence of the JbsA-encoding region and the deduced FbsA protein from 
the serotype II GBS strain 176 H4A. The putative ribosomal binding site (RBS) is underlined 
and the potential transcriptional terminator is indicated by antiparallel arrows. Within the 
deduced FbsA protein, letters in bold and italic indicate the putative signal peptide sequence and 
letters in bold and underlined mark the cell wall anchor motif LPKTG. Repeats in FbsA are 
numbered and marked by arrows. 

Fig. 6 shows the DNA sequence of the JbsA-cncoding region and the deduced FbsA protein from 
the c^sule GBS mutant O90R. The putative ribosomal binding site (RBS) is underlined and the 
potential transcriptional terminator is indicated by antiparallel arrows. Within the deduced FbsA 
protein, letters in bold and italic indicate the putative signal peptide sequence and letters in bold 
and underlined mark the cell wall anchor motif LPKTG. Repeats in FbsA are niunbered and 
marked by arrows. 

Fig. 7 shows the DNA sequence of the JbsA-encoding region and the deduced FbsA protein from 
the serotype V GBS strain SSI 169. The putative ribosomal binding site (RBS) is underlined and 
the potential transcriptional terminator is indicated by antiparallel arrows. Within the deduced 
FbsA protein, letters in bold and italic indicate the putative signal peptide sequence and letters in 
bold and underlined mark the cell wall anchor motif LPKTG. Repeats in FbsA are numbered and 
marked by arrows. 

Fig. 8 shows a schematic comparison of the FbsA proteins fix)m the GBS strains 6313 (serotype 
m), 706 S2 (serotype la), 33H1A (serotype lb), 0176 H4A (serotype II), O90R (derived from 
serotype la) and SS1169 (serotype V), respectively. Indicated are the locations of the signal 
pq)tide (black box), the wall-spanning region (WSR; boxes with vertical bars), the cell wall 
anchor motif (LPKTG), and the membrane-spanning region (MSR; boxes with diagonal bars). 
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The number of individual repeats is indicated for each protein. Grey boxes represent a repeat 
with the sequence motif 'GNVLERRQRDAENRSQ\ boxes with horizontal bars represent 
repeats with an R14K substitution and dotted boxes show the location of repeats with both an 
AllV and R14K substitution. Repeats that carry an E12D substitution are indicated below the 
FbsA proteins from GBS strains 33H1A and SSI 169. Above FbsA from 33H1A, a repeat 
carrying a single AllV substitution is indicated. 

Fig. 9 shows a Western blot analysis of truncated FbsA derivatives to identify the fibrinogen- 
binding domain in FbsA. Hexahistidyl-tagged fusion proteins, representing the mature FbsA 
protein (FbsA- 19), the N-terminal repeat-containing region (FbsA-N) or the C-terminal part 
(FbsA-C) of FbsA were separated by SDS-PAGE, blotted onto nitrocellulose and tested for their 
binding to human fibrinogen. The fibrinogen binding activity of the three proteins encoded by 
different constructs are indicated below the schematic FbsA drawing. 

Fig. 10 shows the competitive inhibition of fibrinogen binding to GBS 6313 by the purified 
fusion proteins FbsA-19, FbsA-9 and Bsp, respectively. FbsA-9 differs from FbsA-19 in that it 
contains only 9 repeats in its repeat domain. The binding assay was performed with *^^I-labelled 
fibrinogen in the presence of diflferent concentrations of each fusion protein. Each experiment 
was performed at least in triplicate. 

Fig. 11 shows a spot membrane analysis of fibrinogen binding by synthetic peptides derived 
from the repeat unit of FbsA. Fibrinogen binding was tested with peptides carrying the FbsA 
repeat motif 'GNVLERRQRDAENRSQ' (SEQ ID 113) and with peptides containing the 
scrambled sequence 'GLSQNRDVRENQRARE'. (SEQ ID 205) Synthetic peptides, which 
differed from the repeat motif in that single amino acids had been replaced by alanine, were 
probed for fibrinogen binding. Beside the spot membrane, the sequence of each synthetic peptide 
is listed. Bold and underlined letters indicate amino acid substitutions within the repeat motif. 

Fig. 12 shows a spot membrane analysis of the fibrinogen binding repeat unit. Synthetic peptides 
were tested for fibrinogen binding, in which each of the amino acids of the fibrinogen-binding 
repeat was replaced by each of the 20 amino acids. The vertical letters, printed in bold, represent 
the FbsA-derived fibrinogen binding sequence 'GNVLERRQRDAENRSQ'. The horizontal 
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letters represent those amino acids that were introduced in the synthetic peptides instead of the 
- original amino acid in the respective position. 

Fig. 13 shows the competitive inhibition of fibrinogen binding to GBS 6313 by synthetic 
peptides. The binding assay was performed with '^^I-labelled fibrinogen in the presence of 
different concentrations of the peptides pep^FbsA (SEQ ID 211), carrying an FbsA-derived 
repeat unit, and pep_R6A, possessing an R6A substitution within the repeat unit. Each 
experiment was performed at least in triplicate. 

Fig. 14 shows eukaryotic cell adherence (A) and invasion (B) of GBS strains 6313, 706 S2, and 
O90R and their respective fbsA deletion mutants. The values represent the result of at least four 
independent experiments performed in triplicate. Error bars are indicated. 

Fig. 15. shows a peptide ELISA of FbsA peptides with himian sera. The 5 biotinylated peptides 
(wild type <1>: GNVLERRQRDAENRSQ SEQ ID No. 113; alanine mutant peptides: <2> 
GAVLERRQRDAENRSQ SEQ ID No. 207, <3> GNALERRQRDAENRSQ SEQ ID No. 208, 
<4> GNVLEARQRDAENRSQ SEQ ID No. 211, <5> GNVLERAQRDAENRSQ SEQ ID No. 
212; see also Fig.l 1) were coated on Streptavidin-coated ELISA plates and analysed using 5 sera 
firom patients infected with GBS. The patient sera were applied in a dilution of 1:200 and 
1:1,000. IgG (A) and IgA (B) antibodies were detected with secondary anti-human antibodies 
coupled to Horse Radish Peroxidase and ABTS as substrate. 

Fig. 16 shows the DNA sequence of the pa6^/B-encoding region and the deduced PabA (nt 319- 
2964) and PabB (nt3087-511 1) proteins from GBS 6313. Putative ribosomal binding sites (RBS) 
are xmderlined. Letters in bold and italics indicate the putative signal peptides of the deduced 
PabA and PabB proteins and letters in bold and underlined mark the region with high identity to 
the cell wall anchor motif from Gram positive bacteria. 

Fig. 17 shows the DNA sequence of the pa6C/D-encoding region and the deduced PabC (nt 487- 
2394) and PabD (nt 2461-3006) proteins from GBS 6313. Putative ribosomal binding sites 
(RBS) are underlined. Letters in bold and italics indicate the putative signal peptides of the 
deduced PabC and PabD proteins. 
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Fig. 18 shows a scanning electron microscopy of A549 cells incubated for two hours with latex 
- beads coated with Pab A, PabB, PabC, PabD, respectively. BSA-coated latex beads were used as 
a control. 

Fig. 19 shows the adherence of GBS 6313 to and invasion of A549 cells in the presence of 
lOOng/ml of Pab A, PabB, PabC or PabD fusion proteins. The adherence of GBS 6313 to A549 
cells (A) and its internalization into these cells (B) was arbitrarily set to 100% and the results 
obtained in the presence of the different fusion proteins was related to these values. Each 
experiment was performed at least three times in triplicate. 

Fig. 20 shows eukaryotic cell adherence and internalization by GBS 6313 and its pabA md pabB 
deletion mutants. The adherence of GBS 6313 to A549 cells (A) and its internalization into these 
cells (B) was arbitrarily set to 100% and the results obtained with the GBS mutants 6313 ApabA 
and 63l3lspabB were related to these values. Each experiment was performed at least three times 
in triplicate. 

Fig. 21 shows the testing of anti-PabA, anti-PabB, and anti-PabD antisera for their sensitivity in 
detecting their respective antigens. Serial dilutions of the fusion proteins PabA, PabB, and PabD 
were spotted onto nitrocellulose and probed with a 1:1000 dilution of the mice sera against the 
respective proteins. Bound antibodies were labelled with an anti-mouse-HRP conjugate and 
visualized by chemiluminescence. 

Fig. 22 shows a Western blot analysis of culture supernatant of different S, agalactiae strains and 
their isogenic JbsA deletion mutants for the presence of fibrinogen binding proteins. IS^ig of 
proteins from concentrated culture supernatant of the different S. agalactiae strains and their 
JbsA deletion mutants was size separated by SDS-PAGE, blotted onto nitrocellulose and tested 
for the interaction with human fibrinogen. Bound fibrinogen was detected by incubating the blot 
with rabbit anti-fibrinogen antibodies followed by an incubation with goat anti-rabbit antibodies 
coupled to horseradish peroxidase. For the detection of fibrinogen-antibody complexes, 
chemiluminescence was used. 

Fig. 23 shows the binding of different S, agalactiae strains and their fbsA deletion mutants to 
immobilized fibrinogen. Similar cell numbers of the different strains were incubated with 
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fibrinogen, which was immobilized to Terasaki plates. The number of bacteria bound to 
K fibrinogen was related to the number of input bacteria into the assay. 

Fig. 24 shows the adherence and internalization of different 5. agalactiae strains and their 
isogenic fbsA mutants into the lung epithelial cell line A549. Similar numbers of bacteria were 
used to infect A549 cells and the number of bacteria adherent to (A) and intemalized by A549 
cells (B) was related to the number of input bacteria. 

Fig. 25 shows the adherence and internalization of the S, agalactiae strains 6313 and GilZ^sA 
into the fibroblast cell line HEL299. HEL299 cells were infected with S, agalactiae at an MOI of 
10:1 and the cell adherent and intemalized bacteria were related to the number of input bacteria. 

Fig. 26 shows the influence of FbsA protein on the adherence of S, agalactiae to A549 cells. The 
adherence assay was performed in the presence of different amoimts of purified FbsA fusion 
protein and the number of cell adherent bacteria was related to the number of input bacteria. 

Fig. 27 shows the binding of FbA-coated latex beads to human A549 cells. Latex beads were 
either coated with BSA (A) or FbsA fusion protein (B-D) and the interaction of the coated beads 
with the lung epithelial cell line A549 was analyzed by scanning electron microscopy. 

Fig. 28: shows the transcriptional organization of the pafrC-encoding region in 5. agalactiae. 
The names on top of the figure indicate the genes to which the primer pairs annealed during PCR 
with total RNA (A), RT-PCR with total RNA (B) or PCR with chromosomal DNA (C) firom 5. 
agalactiae 63 1 3 . 

Fig. 29: shows the PCR-analysis of GBS strains for the presence of pabC and pabD genes. The 
following strains were used for the PCR: 1, 5. agalactiae 1137 (la); 2, S. agalactiae A90/14 (lb); 
3, S. agalactiae 63 13 (III); 4, S. agalactiae 4416 S3 (III); 5, S. agalactiae 4357 (V); 6, 5. 
agalactiae 4327 (V). 

Fig. 30: shows the comparison of the amino acid sequences of the PabC proteins from S. 
agalactiae 6313, S. agalactiae NEM316, and S, agalactiae 2003 V_R. 
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Fig. 31: shows (A) the restriction map of the pabC-encoding region in S, agalactiae and (B) the 
Western blot analysis of PabC and Gbs0851 fusion proteins for fibrinogen-binding. The fusion 
proteins were size-separated by SDS-PAGE, transferred onto a nitrocellulose membrane and 
tested for fibrinogen binding by Western blotting. Bound fibrinogen was detected with rabbit 
anti-fibrinogen antibodies, followed by peroxidase-labelled goat anti-rabbit antibodies, and 
visualized by chemiluminescence. PabC and Gbs0851: full-length fusion proteins; PabC-N: N- 
terminal 388 amino acids of PabC; PabC-C: C-terminal 222 amino acids of PabC. 
(C) Identification of the FbsA and PabC-binding sites within human fibrinogen by Westem blot 
anal>^is. Human fibrinogen was size separated by SDS-PAGE and either Coomassie stained (left 
lane) or transfeired onto nitrocellulose and tested for FbsA- or PabC-binding by Westem 
blotting. Bound fusion proteins were detected with mouse anti-HisTag antibodies, followed by 
peroxidase-conjugated goat anti-mouse IgG fab firagments and visualized by chemiluminescence. 

Fig. 32: shows the binding of recombinant PabC fusion proteins to immobilized fibrinogen in a 
capture ELISA assay. Microliter wells were coated with a fixed amoxrnt of human fibrinogen, 
followed by the addition of increasing concentrations of the different PabC fusion proteins. 
Bound fusion protein was detected with mouse anti-HisTag antibodies and peroxidase- 
conjugated goat anti-mouse IgG fab firagments. Colour development was initiated by the addition 
of tetramethyl-benzidine substrate and stopped with H2SO4. The absorbance of the microtiter 
wells was read at 450 nm. Values represent the means of three independent experiments, each 
performed in tripUcate. 

Fig. 33: shows the adherence (A) and invasion (B) of the lung epithelial cell line A549 by the 5. 
agalactiae strains 6313 pAT32, HapabC pA132 and t^abC^P^pabC, respectively. Bacterial 
adherence and invasion were calculated as follows: Adherence=number of adherent bacteria / 
total number of bacteria in the assay x 100. Invasion=number of internalized bacteria / total 
number of bacteria in the assay x 100. Each experiment was performed at least three times in 
triplicate. (C) Eukaryotic cell adherence and invasion (D) of 5. agalactiae 63 13 in the presence 
of different amounts of PabC and Bsp fusion proteins. Bacterial adherence and invasion were 
calculated as described in flie legend of Fig. 32. Each experiment was performed at least three 
times in triplicate. 



wo 2004/035618 




PCT/EP2003/011436 



EXAMPLES 

Example 1 : Experimental procedures 

It is to be noted that the following materials and methods were used throughout the examples 
described herein if not indicated to the contrary. 

Bacterial strains and culture conditions 

GBS strains 6313 (serotype III) and SSI 169 (serotype V) represent reference strains and have 
been described previously (Wibawan and Lammler, 1992). GBS strains 706 S2 (serotype la), 
33H1A (serotype lb), and 176 H4A (serotype II) were kindly provided by G. S. Chhatwal (GBF 
Braunschweig). GBS strain O90R (ATCC 12386) is a derivative of the serotype la strain O90. 
All GBS strains belonging to the serological groups la, lb, n. III, and V, respectively, are clinical 
isolates and were isolated from infected neonates, while GBS strains from group IV were 
isolated from cows with mastitis (Chhatwal et al., 1984). E. coli DH5a (Hanahan, 1985) was 
used for cloning purposes and E. coli BL21 (Dubendorff and Studier, 1991) served as host for 
the production of FbsA fizsion proteins. The alkaline-phosphatase-negative E. coli strain CC118 
(Manoil and Beckwith, 1985) served as host for pHRM104-derivates and for tiie screening for 
isignal-peptide encoding sequences from GBS. 

GBS was cultivated at 37°C in Todd-Hewitt yeast broth (THY) containing 1% yeast extract- E. 
coli was grown at 37°C in Luria broth (LB) and clones carrying cosmid pTEX5236 or plasmid 
pET28a or pHRM104 were selected in the presence of chloramphenicol (15 jxg/ml), kanamycin 
(50 ^g/ml) or erythromycin (300 ng/ml). Screening for alkaline phosphatase secreting E. coli 
CCl 18 clones was performed on LB-plates containing 80 p/ml X-phosphate (Sigma). 

Antibodies, enzymes, peptides and human proteins 

Affinity-purified rabbit anti-fibrinogen and peroxidase-labelled anti-rabbit antibodies were 
obtained fix)m Dako-Biochemicals, Peroxidase-labelled goat anti-mouse antibodies were 
purchased from Dianova. Monoclonal anti-his-tag antibodies were obtained from Roche 
Diagnostics. Purified rabbit anti-fibronectm antibodies, trypsin, pronase, vitronectin, laminin, 
IgG, fibronectin, and fibrinogen were purchased from Sigma-Aldrich. Fibrinogen (Sigma) was 
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passed through a gelatin-Sepharose colimui to remove residual contaminating fibronectin in the 
r preparation. The purity of the fibrinogen preparation was confirmed by SDS-PAGE and 
Coomassie-staining and by Western blotting using anti-fibronectin antibodies. Synthetic peptides 
for spot membrane analysis and for inhibition experiments were synthesized as described 
previously (Frank and Overwin, 1996). 

Plasmids and cosmids used for cloning purposes 

A cosmid gene library fi-om GBS 63 13 (Reinscheid et aL, 2001) was used for the isolation of the 
fbsA'gene 6om GBS. Low-copy cosmid pTEX5236 was also used for subcloning of the fbsA 
gene after partial digestion of an yb^^^-carrying cosmid with Sau3A, Plasmid pET28a (Novagen) 
was used for the synthesis of the hexahistidyl-tagged FbsA, PabA, PabB, PabC, and PabD fusion 
proteins, which were constructed as follows: A truncated fbsA gene, devoid of the coding region 
of the signal peptide and the membrane spanning domain, was PGR amplified from 
chromosomal DNA of GBS 6313 using the primers 1 
5 ^GTCCTGTATCT GCC ATGGA TAGTGTTGG (SEQ ID No. 223) and 2 
5 CCGC GGATCC A C ATTTTGATCATC ACCTG (SEQ ID No. 224). The repeat-encoding 
region of fbsA was amplified with the primers 3 
5'GTCCTGTATCTGCCATGGATAGTGTTGG (SEQ ID No. 225) and 4 
5'CCGCGGATCCCCTATAAGTTGACCTAC (SEQ ID No. 226). Amplification of the non- 
repeat region of fbsA was performed with the primers 5 
5 TGCTTT GCC ATCiGT AGGTC AACTT AT AGGG (SEQ ID No. 227) and 6 
5'CCGCGGATCCACATTTTGATCATCACCTG (SEQ ID No. 228). The Ncol and BaniHl 
restriction sites used for cloning are underlined. Amplification of the pabA, pabB, pabC and 
pabD genes, devoid of the coding region of the signal peptide and, if present, of the membrane 
spanning domain, was performed with the primers pabAl 
5'GTGCCTTGCCAIGGAAAGTACCGTAGCGG (SEQ ID No. 229), pabA2 5' 
GCGGACAG CTCGAGT TTCCCACCTGTCATCGG (SEQ ID No. 230), pabBl 
5'GTGCCTTGCCATGGACGACGTAACAACTGATAC (SEQ ID No, 231), pabB2 
5 GCGGACA GCTCGAGT GTACCAATACCACCTG (SEQ ID No. 232), pabCl 
5 ^GTGCCTTG CCATGG GCCGGGATAACTAAAG (SEQ ID No. 233), pabC2 
5 GCGGAC A GCTCGAGC TCTTTTATACGCCATGAG (SEQ ID No. 234), pabDl 
5 CCG CGGATCCG ATGATAACTTTGAAATGCC (SEQ ID No. 235) and pabD2 
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S TGGCA C AAGCTTA C ATTCTGAGC AGAAAGC (SEQ ID No. 236). 



The NcoU Xhol and the BamHI^ HindUl restriction sites used for cloning are undo-lined. The 
PGR products and plasmid pET28a were digested with the indicated restriction enzymes, ligated 
and transformed into E, coli BL21. Plasmid pETfbsA-9, carryingy&j^ with nine internal repeats, 
was constructed by partial digestion of pETfbsA-19 with Xbal, subsequent religation and 
transformation into E. coli BL21. 

A plasmid library of GBS chromosomal fragmens was constructed in plasmid pHRM104 
essentially as described elsewhere (Pearce et al., 1993). Briefly, chromosomal DNA from GBS 
6313 was fractionated by sonication for 45 sec, the obtained fragments were blunt-ended by 
Klenow polymerase, ligated into Smal digested pHRM104, and the ligation mixture transformed 
into E. coli CC118. Transformants were plated onto erythromycin and X-phosphate containing 
agar plates and incubated for three days. 

Southern and blot analysis 

Chromosomal DNA from GBS was prepared as described elsewhere (Pospiech, 1995). 
Digoxigenin-labelled probes of the inserts in plasmid pHRM104 were obtained by PGR with the 
primers 7 5'AATATCGCCCTGAGC (SEQ ID No. 237) and 8 5'GGTTTTCCCAGTCACG 
(SEQ ID No. 238). The same primers were also used for sequencing the inserts in the pHRM104 
derivates. Digoxigenin-labelled probes of the genes JbsA, pabA/B and pabC/D, respectively, 
were obtained by PGR with the primers JbsAl 5'GTCCTGTATCTGCTATGGATAGTGTTGG 
(SEQ ID No. 239), fbsA2 5'ACATTTTGATCATCACCTG (SEQ ID No. 240), pabA 
5'ACTGCTGAGCTAACAGGTG (SEQ ID No. 241), pabB 5' 
ACATCACCTGACAATGTCGC (SEQ ID No. 242), pa^C 5 'GCGATTGTGAATAGAATGAG 
(SEQ ID No. 243), and pabD 5 TATACAAAGCCTGAGCTTC (SEQ ID No. 244). To analyze 
the distribution of the genes JbsA^ pabA/B and pabC/D among different clinical isolates of GBS, 
their chromosomal DNA was digested with Hindlll^ BstEH or Ncol and hybridized to the fbsA-^ 
pabAJB' or pabB/C specific probe. Labelling, hybridization, washing and detection in Southern 
blots was performed using the Dig-labelling and detection kit (Roche Diagnostics) according to 
the instructions of the manufacturer with subsequent detection by chemiluminescence. 
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PCR-amplification and sequencing of fbsA from different GBS strains 
- They55y4 gene was amplified from the chromosome of the GBS strains 706 S2, 33H1A, 176 
H4A, O90R and SSI 169 by PGR using the primers 9 5 'TTACCGTAGCCTGTATCACC (SEQ 
ID No- 245) and 10 5'CGACCTACGATAGCAACG (SEQ ID No. 246) and the PGR products 
were subsequently sequenced. The nucleotide sequence of Wi^JbsA gene from strain 6313 was 
obtained by sequencing the 2.6 kb insert of pTEXfbsA. 

Construction ofJbsA deletion mutants 

The thermosensitive plasmid pG^ost6 (Appligene) was used for targeted deletion of the ^5^4 
gene in the GBS strains 6313, 706 S2, and O90R, respectively. Two fragments flanking XYvcfbsA 
gene were amplified by PGR from chromosomal DNA of GBS 6313 using the primer pairs 
fbsAJicW 5 CCGC GGATGC GAATATGCTACGATGAG (SEQ ID No. 247) and y&5.4_del2 
S'CCCATCCACTAAACTTAAACATTCCTGATITCCAAGTIC (SEQ ID No. 248) as well as 
fbsAJi^li yTGTTTAAGTTTAGTGGATGGGGCTGCGGTTTGAGACGC (SEQ ID No. 249) 
and fl}sAjle\4 STGGCAC/L^GCTTTAGGTGGTGAGGGAGTTG (SEQ ID No. 250). 
Gomplementary DNA sequences in the primers fbsAjleU and fbsAjicB are marked in italics 
and the BamHI and HindUL restriction sites in the primers JbsAjioll and JbsA_de\4 are 
underlined. The JbsA flanking PGR products were mixed in equal amounts with each other and 
subjected to crossover PGR by using primers fbsAjieW and JbsAjicU. The resulting PGR 
product consisted of the fbsA flanking regions on a single DNA fragment. The crossover PGR 
product and plasmid pG*host6 were digested with BamHI and HindSSI, ligated and transformed 
into E. coli DH5a. The resuhing plasmid, pG^AJbsA was transformed into the GBS strains 6313, 
706 S2, and O90R, respectively, and transformants were selected by growth on erythromycin 
agar at 30*'G. Gells in which pG^AJbsA had integrated into the chromosome were selected by 
growth of the transformants at 39°G with erythromycin selection as described (Maguin et al, 
1996). Four of such integrants from each strain were serially passaged for three days in liquid 
medium at 30'^G without erythromycin selection to facilitate the excision of plasmid pG^tifbsA^ 
leaving the desired JbsA deletion in the chromosome. Dilutions of the serially passaged cultures 
were plated onto agar and single colonies were tested for erythromycin sensitivity to identify 
pG^AjbsA excisants. Ghromosomal DNA of the parental GBS strains 6313, 706 S2, and O90R, 
respectively, and of 10 erythromycin sensitive GBS excisants from each strain was tested by 
Southem blot after HindlU digestion using a digoxigenin-labelled JbsA flanking fragment 
obtained with the primers ft}sA_de\3 and JbsA^^lA, 
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Construction of pabA and pabB deletion mutants 

Deletion mutants in the genes pabA and pabB, respectively, were constructed in GBS 6313 as 
described for the construction of JbsA deletion mutants. The primer pairs used to construct the 
pabA deletion mutant were pabA^dell 5'GTTAAAGGTAACCTGCCTG (SEQ ID No. 251), 
pabA_del2 5 'CCCATCCACTAAACTTAAACAT KC/^CICCTATVGTG^ 
(SEQ ID No. 252) as well as pabA_del3 

5'TGTTTAAGTTTAGTGGATGGGCACTlAGAGATmCCAATCC (SEQ ID No. 253) and 
pabA_del4 5 'GACATCATAGATCCACC (SEQ ID No. 254). After cross-over PGR the 
resulting PGR fragment and vector pG^ost6 were digested with HinAISl and EcoKl arid 
subsequently ligated, resulting in plasmid pG^ApaM. The primer pairs for deleting pabB were 
pabB_dell S'CCGCGG^rCCGGACjCTACGTTTGAACTTC (SEQ ID No. 255), pabB_del2 
5'CCOirCCyiCr.4^C7T.4^C4ATATTACCGCAGCAC (SEQ ID No. 256) as well as 
pabB_del3 5' TGTTTAAGTTTAGTGGATGGG AC AAGAAGGCCAAGAAGG (SEQ ID No. 

257) and pabB_del4 5 CACGCAACG CGTCGACG C ACAGCTTTAACTGTAC (SEQ ID No. 

258) . The BamHI and Sail restriction sites are underlined- The fragment obtained by cross-over 
PGR and the vector pG^ost6 were digested with BamHI and Sail and ligated, resulting in 
plasmid pG^^abB. Plasmids pG^/SpabA and pG^6pabB were subsequently transformed into 
GBS 6313. The procedure for the generation of pabA and pabB deletion mutants was identical to 
that for the constmction of fbsA deletion mutants. 

General DNA techniques 

Conventional techniques for DNA manipulation, such as restriction enzyme digests, PGR, 
ligation, transformation by electroporation and Southern blotting were performed as described by 
Sambrook et al (Sambrook et aL, 1989). 

Binding of soluble '^^I-labelled fibrinogen to GBS 

Purified human fibrinogen was radiolabelled with *^^I, using the chloramin T method (Hunter 
and Greenwood, 1962). Binding of labelled fibrinogen to GBS was performed essentially as 
described by Chhatwal et al. (1983). Briefly, overnight cultures of GBS were pelleted by 
centrifugation, washed twice with phosphate-buffered saline supplemented with 0.02% Tween 
20 (PBST) and adjusted photometrically to a transmission of 10% at 600 nm. A total of 0.2 ml of 
the bacterial suspension was added to 20 ^1 of '^^-labelled fibrinogen containing 23 ng of 
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fibrinogen. After incubation for 1 h at room temperature, the streptococci were sedimented by 
• centrifugation and washed with I ml of PBST. The radioactivity of the pellet was finally 
measured in a gamma counter (Packard Instruments). The amount of bacterial-bound fibrinogen 
was calculated as the percentage of total radiolabelled fibrinogen added to the bacteria. In 
inhibition experiments, the binding of 23 ng of radiolabelled fibrinogen to 0.2 ml of GBS 
(T=10%) was determined in the presence of various amounts of FbsA fiision proteins, Bsp fusion 
protein or synthetic peptides. Each experiment was repeated at least three times in triplicate. 

Binding of FITC-labelled GBS to immobilized fibrinogen 

Terasaki plates were coated with human fibrinogen and the binding of FITC-labelled bacteria to 
the immobilized fibrinogen was measured as described by Podbielski et al (Podbielski et al, 
1999). In brief, lOjal of a 100|ag/ml stock solution of himian fibronectin, fibrinogen, laminin and 
collagen I and IV, respectively, was added to each well and incubated overnight at room 
temperature in a moist chamber. Subsequently, the microtiter plates were washed with PBS and 
residual buffer was carefully removed. FITC-labelling of GBS was performed with cultures in 
the exponential (ODeoo: 0,5) and in the stationary (ODaoo: 15) growth phase. 12 ml of bacterial 
culture were pelleted by centrifugation, washed with 12 ml of PBS and resuspended in 2 ml 
FITC-solution (1 mg/ml FITC in 50 mM sodium carbonate buffer, pH 9.2). Following a 20 min 
incubation in the dark, the cells were pelleted by centrifugation, washed twice with PBS and 
sonicated for 20 sec to disrapt bacterial chains. The bacterial suspension was adjusted to an 
OD600: 1.0 with PBS, vortexed vigurously and kept in the dark until use. 10 |il of FITC-labelled 
GBS suspension was added to each Terasaki well coated with different human proteins. After a 
60 min incubation at 37^C, unbound bacteria were removed by five washes with PBS and bound 
bacteria were fixed with 0.5% glutaraledhyde for 5 min. The plates were finally washed twice 
with PBS and the fluorescence of each well was determined in an automated Cyto Fluor II 
fluorescence reader (PerSeptive Biosystems) at excitation and detection wavelengths of 485 nm 
and 530 nm, respectively. The efficiency of FITC-labelling of the bacteria was determined by 
incubating 500 ^1 of the FITC-labelled bacteria for 60 min at 37*C, three washes of the bacteria 
with PBS, re-suspension of the cells in 500 jil of PBS and measuring the fluorescence of 10 jil 
aliquots of the suspension in uncoated Terasaki mitrotiter plates. Each assay was measured in 
triplicate and repeated at least four times. 
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Preparation and purification of fusion proteins 
. The different FbsA fiision proteins as well as the fusion proteins PabA, PabB, PabC, PabD, and 
Bsp (Reinscheid et aL, 2002) were synthesized in recombinant E. coli BL21 by the addition of 1 
mM IPTG after the culture had reached an optical density of LO. The cells were disrupted using 
a French Press cell and purification of the fusion protein was performed according to the 
instructions of Qiagen using Ni^**^ affinity chromatography. Subsequently, the PabA, PabB and 
PabC fusion proteins were dialyzed against 20 mM Tris/HCl, pH 8.5 and loaded onto a MonoQ 
anion exchange column (Amersham/Pharmacia), A linear gradient firom 0 M to 1.0 M NaCl in 20 
mM Tris/HCl was used to elute the fusion proteins from the column. For further purification of 
PabD, the fusion protein was dialyzed against 20 mM Tris/HCl buffer and loaded onto a MonoS 
cation exchange column (Amersham/Pharmacia). A linear gradient fi-om 0 M to 1.0 M NaCl in 
20 mM Tris/HCl buffer was used for the elution of PabD. All fusion proteins were finally 
dialyzed against PBS and stored at -20X. 

Screening for fibrinogen-binding colonies 

Cosmid-carrying E. coli clones were transferred in duplicate to tetracycline containing LB plates 
and incubated overnight. The next day the colonies of one plate were transferred to nitrocellulose 
for 6 h. The cells on the filter were lysed by chloroform vj^our for 20 min and subsequently 
incubated overnight in PBS with 1 mg/ml lysozyme and 1 mM PMSF. The membrane was 
blocked overnight with 10% skim milk in phosphate-buffered saline (PBS) and subsequently 
probed for binding of human fibrinogen as described below. 

Western Blot and spot membrane analysis 

In Western blot experiments proteins were separated by SDS-PAGE and electroblotted onto 
nitrocellulose. The membrane was subsequently blocked overnight with 10% skim milk in PBS. 
For spot membrane experiments peptides of 16 amino acids were synthesized and equal amounts 
of the peptides were directly spotted onto cellulose paper as described previously (Frank and 
Overwin, 1996). Blocking was performed in membrane blocking solution (MBS) that consisted 
of 20 ml casein based blocking buffer (Genosys Biotechnologies, Cambridge, England), 80 ml 
Tris-buffered saline (TBS), 0.05% tween 20, and 5 g sucrose. Probing for fibrinogen-bindirig 
was performed as described below. 
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Detection of fibrinogen binding by Western blot, spot membrane and colony blot 
Membranes that had been blocked overnight were incubated for 1 h with 2 |ag/ml of human 
fibrinogen For Western and colony blot experiments, fibrinogen and antibodies were diluted in 
PBS while for spot membrane analysis they were diluted in MBS. Following three washes with 
PBS, the membrane was incubated with anti-fibrinogen antibodies (1:1000 in PBS or MBS) for 1 
h. This incubation was followed by three washes with PBS, containing 0.05% tween 20 (PBST) 
and two washes with PBS. Subsequently, the membrane was incubated for 1 h with peroxidase- 
labelled anti-rabbit IgG (1 rlOOO in PBS or MBS). After three washes with PBST and two washes 
with PBS, boimd fibrinogen was detected by chemilimiinescence using the ECL-kit 
(Amersham/Pharmacia). In control experiments, no cross-reactivity of the used antibodies with 
the immobilized proteins and peptides was detected. 

Opsonophagocytosis assay 

Resistance to phagocytosis was measured as described by Podbielski et al (1996). Briefly, a 
growing culture of GBS was adjusted to 10^ colony-forming units per millilitre. 100 \x\ of the 
suspension were added to 300 yl of heparinized human blood and tiie reaction mixture was 
incubated at 37°C with end-over-end rotation for 3 h. Pre- and postincubation aliquots were 
serially diluted and plated onto THY agar for overnight culture. For each strain the ratio of 
colony-forming units prior to, and following 3 h incubation with human blood was calculated. 
Each experiment was performed three times in triplicate. 

Epithelial cell adherence and internalization assay 

Adherence of GBS to epithelial cells and intemalization into epithelial cells was assayed 
essentially as described previously (Caparon et al, 1991; Rubens et al, 1992). Briefly, A549 
cells were transferred to 24- well tissue culture plates at approximately 4x10^ cells per well and 
cultivated overnight in RPMI (Gibco BRL) tissue culture mediimi, supplemented with 10% of 
fetal calf serum. After replacement of the medium with 1 ml of fi-esh medium, the cells were 
infected with 5x10^ streptococci per well and incubated at ST'^C for 2 h. The non-adherent 
bacteria were removed by washing three times with PBS. In adherence assays, the epithelial 
cells were subsequently detached fix)m the well by the addition of trypsin/EDTA and lysed by 
adding 300 ^il of distilled water. Adherent bacteria were quantitated by plating serial dilutions df 
the lysate onto THY agar plates. For intemalization assays the epithelial cells were incubated 
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after 2 h of infection for another 2 h in tissue culture medium supplemented with penicilling G 
. (10 U) and streptomycin (0.01 mg) to kill extracellular bacteria. After three washes with PBS, 
the epithelial cells were detached by the addition of trypsin/EDTA and lysed in 300 \il of 
distilled water. The amount of intracellular bacteria was quantified by plating serial dilutions of 
the lysate onto THY agar plates. Each experiment was repeated at least three times in triplicate. 

In competition studies, 1 ml of fresh tissue culture medium containing 50 fxg of purified fiision 
protein or 1 ng of fibrinogen was added to the A549 cells and subsequently, the cells were 
infected with GBS 6313. 

Interaction of protein-coated latex beads with A549 cells 

Approximately 108 latex beads (3 ^m diameter, Sigma) were washed tree times in PBS and then 
coated with 300 jxg of fiision protein or BSA in 500 ^1 PBS overnight at 4^C. Coated beads wefe 
washed once in PBS and then blocked with 200 ^l of 10 mg/ml BSA in PBS for 1 h at room 
temperature. Beads were washed twice in PBS and once in RPMI + 10% PCS and then 
resupended in 1 ml of RPMI + 10% PCS. 300 \i\ of beads were added to approximately 4 x 10^ 
A549 cells in 24-well plates. The cells were incubated for 1 h at 37**C (5% CO2), washed five 
times with PBS and fixed in a solution containing 3% glutaraldehyde and 5% formaldehyde in 
cacodylate bluffer for 45 min on ice. The samples were washed with cacodylate buffer, 
dehydrated in a graded series of acetone and subjected to critical point drying with CO2. Samples 
were then coated with a 10 nm thick gold film and examined by scanning electron microscopy as 
described previously (Reinscheid et al^ 2001), 

Synthesis of biotinylated peptides 

Peptides were synthesized in small scale (4 mg resin; up to 288 in parallel) using standard F-moc 
chemistry on a Rink amide resin (PepChem, Tubingen, Germany) using a SyroII synthesizer 
(Multisyntech, Witten, Gemiany). After the sequence was assembled, peptides were elongated 
with Fmoc-epsilon-aminohexanoic acid (as a linker) and biotin (Sigma, St. Louis, MO; activated 
like a normal amino acid). Peptides were cleaved off the resin with 93%TFA, 5% triethylsilane, 
and 2% water for one hour. Peptides were dried under vacuum and fiieeze dried three times fi-om 
acetonitrile/water (1:1), The presence of the correct mass was verified by mass spectrometry on a 
Reflex m MALDI-TOF (Bruker, Bremen Germany). The peptides were used without fiuther 
purification. 
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Enzyme linked immune assay (ELISA). 

Biotin-labeled peptides were coating on Streptavidin ELISA plates (EXICON) at 10 |ig/ml 
concentration according to the manufacturer's instructions. Sera were tested at two dilutions, 
200Xand l.OOOX. 

ffighly specific Horse Radish Peroxidase (HRP)-conjugated anti-human IgG or anti-human IgA 
secondary antibodies (Southern Biotech) were used according to the manufacturers' 
recommendations (dilution: l.OOOx). Antigen-antibody complexes were quantified by measuring 
the conversion of the substrate (ABTS) to colored product based on OD4osnin readings in an 
automated ELISA reader (TECAN SUNRISE). Following manual coating, peptide plates were 
processed and analyzed by the Gemini 160 ELISA robot (TECAN) with a built-in reader 
(GENIOS. TECAN). 

Example 2: Identification of a novel S. agalactiae adhesion by a signal peptide tagging 
screen. 

Results 

GBS strain 6313, belonging to serotype in, was tested in binding experiments for its interaction 
with ladiolabelled human vitronectin, laminin, fibronectin, fibrinogen, and IgG. Strain 6313 
accumulated about 50% of the total fibrinogen on its surfece. Of the other proteins tested, none 
interacted in significant amounts (> 5%) with GBS 6313. Treatment of the bacteria with either 
trypsin or pronase reduced the amount of bound fibrinogen to levels below 5%, indicating a 
proteinacious nature of the fibrinogen-binding structures of GBS 6313. 

An Escherichia coli cosmid gene library of GBS 6313 was screened by colony blotting for the 
presence of fibrinogai-binding E. coli clones, resulting in the identification of a clone that 
revealed strong interaction with human fibrinogen. Partial digestion of its cosmid with 5'aM3A 
and subcloning of firagments in the range of 2-3 kb in plasmid pTEX5236 resulted in the 
isolation of plasmid pTEXfbsA, carrying a 2.6 kb insert that conferred fibrinogen-binding to E. 
coli DH5a. The insert of pTEXfbsA was sequenced and the analysis of the obtained sequence 
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identified one open reading frame of 1329 bp, designated JbsA as it encodes a fibrinogen-binding 
. protein from S. ggalactiae (Fig. 1). The JbsA gene is preceded by a typical ribosomal binding site 
(AGGAGA) and followed by a sequence resembling a transcriptional terminator (AG*'— 18 
kcal/mol). Analysis of the fbsA-mcodin^ region revealed for the deduced FbsA protein typical 
features of a surface-located protein from streptococci (Fig. 1), i.e. a signal peptide sequence of 
35 amino acids (Nielsen et al, 1997) at its N-terminus and a cell wall anchor motif (LPKTG) 
(Schneewind et aL, 1993) at its C-terminus. The JbsA gene encodes a primary translation product 
of 442 amino acids (Mr 51319), which is putatively processed posttranslationally to yield a 
mature protein of 378 amino acids (Mr 44260). The most striking feature of FbsA is its highly 
repetitive nature: FbsA carries 19 complete repeats of 16 amino acids that are almost identical. 
14 of the 19 repeats are comprised of the sequence motif 'GNVLERRQRDAENRSQ' while two 
repeats (3 and 10) carry an R14K substitution and three repeats (2, 9, and 19) possess both an 
Al IV and an R14K substitution. 

Southem blot experiments with clinical GBS isolates, belonging to the serotypes la, lb, II, HI, 
IV, and V, were performed to analyze the presence of JbsA in GBS. By Southem blot analysis, 
ihQjbsA gene was detected in 25 of 27 strains (Fig. 2), indicating a wide distribution of JbsA in 
different serotypes of GBS. Interestingly, the size of the JbsA grae varied significantly between 
the individual strains in the Southem blot analysis. To imravel the molecular basis of this size 
variation, the JbsA gene was amplified by PGR from the GBS strains 706 S2 (serotype la), 
33H1A (serotype lb), 0176 H4A (serotype II), SSI 169 (serotype V), and O90R (a capsule 
mutant derived from a serotype la strain) and sequenced. Analysis of the obtained sequences 
revealed one open reading frame in each PGR product with high identity to JbsA from GBS strain 
6313 (Figs. 3-7). Analysis of the deduced FbsA proteins identified in all of them a putative 
signal peptide at their N-termini and a putative cell wall anchor at their C-termini. As expected 
from the Southem blot experiments, the size of the single proteins is significantly different. The 
primary translation product of JbsA is 410 amino acids for strain 706 82 (Fig. 3), 346 amino 
acids for strain 33H1A (Fig. 4), 186 amino acids for strain 176 H4A (Fig. 5), 298 amino acids 
for strain O90R (Fig. 6), and 618 amino acids for strain SSI 169 (Fig. 7). As shown in Fig. 8, the 
different sizes between the single FbsA proteins are exclusively due to a different number of 
repeats within the individual proteins. Fig. 8 also shows, that the individual repeats of the 
deduced FbsA proteins reveal differences in their amino acid composition. Thus, iiMjbsA gene 
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from different GBS strains appears to be highly variable in the number of and flexible in the 
composition of single repeat-encoding units. 

Example 3: FbsA is the fibrinogen receptor of Streptococcus agalactiae. 
Results 

For functional analysis of FbsA, a truncated FbsA polypeptide (FbsA- 19), devoid of a signal 
peptide and a membrane-spanning region was synthesized as a hexa-histidyl fusion protein in E, 
coli BL21 and purified by affinity chromatography. In Western blot experiments FbsA- 19 
revealed binding to human fibrinogen (Fig. 9), confirming FbsA as a fibrinogen receptor from 
GBS. To localize the fibrinogen-binding region in the FbsA protein, the N-terminal and the C- 
terminal regions of FbsA were synthesized as FbsA-N and FbsA-C fusion proteins and tested for 
fibrinogen binding. As shown in Fig. 9, fibrinogen binding was observed for FbsA-N but not for 
FbsA-C, indicating that the N-terminal repeats of FbsA mediates fibrinogen binding. 

In competitive inhibition experiments with ^^^I-labelled fibrinogen, different proteins were tested 
for their capability to interfere with the binding of radiolabelled fibrinogen to GBS. As a control, 
the non-fibrinogen binding surface protein Bsp from GBS (Reinscheid et aL, 2002) was tested 
for inhibiting the binding of fibrinogen to GBS. As shown in Fig. 10, the addition of increasing 
concentrations of Bsp had no effect on fibrinogen binding by GBS. However, increasing 
concentrations of purified FbsA-19 substantially inhibited the binding of '^^I-labelled fibrinogen 
to GBS 6313 cells. To analyse, if the number of repeats of FbsA has an effect on fibrinogen 
binding, a derivative of FbsA with only 9 repeats (FbsA-9) was tested for its capability to inhibit 
fibrinogen binding by GBS. Interestingly, significantly higher concentration of FbsA-9 had to be 
used to obtain a comparable inhibition of fibrinogen binding as obtained with FbsA- 19. This 
finding indicates that increasing numbers of repeats either increases the affinity of FbsA for 
fibrinogen and/or supports a higher amoimt of fibrinogen to be bound by FbsA. 

To fiirther characterize the interaction of FbsA and fibrinogen on the molecular level, FbsA- 
derived synthetic peptides were tested for their interaction with human fibrinogra. At first, we 
analysed a single repeat unit of FbsA (GNVLERRQRDAENRSQ) for its capability to interact 
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with human fibrinogen. In Dot Blot experiments a strong interaction of this synthetic peptide 
with human fibrinogen was observed while a randomised peptide containing the identical 
amounts of amino acids but in different order, showed no binding of fibrinogen (Fig. 11). This 
result shows that a single repeat unit of FbsA is capable of specific binding to human fibrinogen. 
To identify amino acids in the repeat region that are essential for fibrinogen binding, we 
synthesized peptides that contained single alanine replacements at different positions. Testing of 
these peptides for their interaction with fibrinogen (Fig. 11) identified N^, V^, L^, and of 
the repeat sequence to be essential for fibrinogen binding. Furthermore, substitution of G\ R^, 
and R'^ by alanine significantly reduced the interaction of the repeat unit with human fibrinogen. 

A comprehensive analysis of fibrinogen binding by the 16 amino acid sequence motif was 
performed to identify putative conservative substitutions within the repeat regions. Therefore, 
synthetic peptides, derived from the sequence motif 'GNVLERRQRDAENRSQ' were 
synthesized and directly spotted onto a membrane. Every peptide differed from each other by a 
single amino acid substitution. In this way, every amino acid within the repeat was successively 
replaced by one of the twenty proteinacious amino acids. Testing of the individual spots for 
fibrinogen binding resulted in a complex picture of the interaction between fibrinogen and the 
repeat unit (Fig. 12). Replacement of by any other amino acid reduced the fibrinogen binding 
of the repeat although binding was not completely abolished. N2S and N2T substitutions did not 
affect fibrinogen binding, although replacement of by any other amino acid significantly 
reduced fibrinogen binding. V'' and could not be replaced by other amino acids without 
significant reduction of binding fimction. Fibrinogen binding was not affected by ESA, ESM and 
ESQ substitutions but any other amino acid in this position resulted in a lower binding of 
fibrinogen. Substitutions of R^ predominantly caused a loss of fibrinogen binding while peptides 
with R6A, R6K and R6W substitutions retained little binding activity. However, replacement of 
R^ by any other amino acid resulted in a loss of fibrinogen binding. could be substituted by 
many amino acids without an effect on binding while R^ could only be replaced by K or W 
without affecting binding. DlOA, DlOE, DION, and DIOQ substitutions had no effect on 
fibrinogen binding while the same was true for Al IF, Al II, Al IL, Al IV and Al 1 Y changes. 
E^^ and N^*^ could be replaced by a variety of amino acids without affecting binding. In contrast, 
only R14K substitutions retained fibrinogen binding of the peptide. Finally, S^^ and Q^^ could be 
replaced by many other amino acids without loss of binding function. Derived from the result of 
the spotting membrane experiment, the following fibrinogen binding motif can be postulated: G- 
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N/S/T-V-L-A/E/M/Q-R-R-X-K/R/W-A/D/E/N/Q-^ (SEQ ID No. 

222). This consensus motif could not be identified in fibrinogen binding proteins from other 
organisms, indicating that it represents a novel type of fibrinogen binding site. 

Derived from the results of the spot membrane analysis, two different synthetic peptides were 
tested for their capability to inhibit fibrinogen binding of GBS. One peptide (pep__FbsA) 
represented the original repeat unit sequence 'GNVLERRQRDAENRSQ' (SEQ ID No. 113) 
while the other peptide (pep_R6A) carried an R6A substitution. In spot membrane analysis, the 
latter peptide had revealed a significantly reduced binding to fibrinogen. In competitive 
inhibition experiments, both peptides were tested for inhibiting the binding of radiolabelled 
fibrinogen to GBS (Fig. 13). A concentration of 160 ^iM of pep_FbsA inhibited fibrinogen 
binding by 80% whereas the same concentration of pep_R6A caused only 20% inhibition of 
fibrinogen binding. These findings demonstrate that the soluble form of the repeat unit of FbsA 
is capable of fibrinogen binding. Furthermore, the difference in the inhibition of fibrinogen 
binding between the two peptides confirms the results of the spot membrane analysis and shows 
that R^ plays an important role in fibrinogen binding. 

To analyse the contribution of FbsA for the fibrinogen binding of GBS, jbsA deletion mutants 
were constracted in tiie GBS strains 6313, 706 82, and O90R, respectively. Southern blot 
analysis revealed the successful deletion of fbs A in the respective strains (data not shown), which 
were termed accordingly 6313A/b5>4, 706 Slt^sA^ and 09QiRAjbsA. Mutants and parental 
strains were subsequently tested for their binding of soluble and immobilized fibrinogen. While 
GBS strains 6313, 706 S2 and O90R exhibited about 50%, 8%, and 12% binding of ^^^I-labelled 
soluble fibrinogen, their respective fbsA mutants bound less than 2%. Similarly, in binding 
experiments using FITC- labelled bacteria, about 45%, 15%, and 24% of the total bacteria from 
the GBS strains 6313, 706 S2, and O90R bound to immobilized fibrinogen but less than 2% of 
the respective JbsA mutants interacted with the immobilized fibrinogen. From these results it can 
be concluded that FbsA is the major fibrinogen-binding protein in the GBS strains 6313, 706 S2, 
and O90R, respectively, and that it mediates the binding of the bacteria both to soluble and to 
inunobilized fibrinogen. 
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Example 4: FbsA contributes to adherence and invasion of epithelial cells and inhibits 
opsonophagocytosis. 



Results 



To analyse the importance of FbsA for protecting GBS from opsonophagocytosis, the GBS 
strains 6313 and 6313^sA were tested for survival in a classical bactericidal assay in whole 
human blood. After inoculation of heparinized human blood with 100±30 colony forming units 
(cfii) of either of the two strains, both strains revealed growth, however, after three hours of 
incubation, strain 6313 grew to 2500±500 cfu/assay while strain 63l3A/bsA grew only to 
800±100 cfii/assay. This finding indicates a role of FbsA in preventing opsonization. 

The GBS strains 6313, 706 S2 and O90R, and their respective fbsA deletion mutants were also 
tested for their ability to adhere to and invade the human lung epithelial cell line AS49. As 
shown in Fig. 14A, the adhesion of the fbsA deletion mutants to A549 cells was significantly 
impaired compared to their parental strains. Similarly, the ability of the fbsA deletion mutants to 
invade AS49 cells was also drastically reduced (Fig. 14B). To analyse this effect in more detail, 
the ability of GBS 6313 to adhere to and to invade A549 cells in the presence of l^ig/ml of 
externally added fibrinogen was quantitated. The addition of fibrinogen resulted in a 90% 
reduction of the adherence of GBS 6313 to and invasion of A549 cells. Taken together, these 
findings indicate that in GBS the binding of FbsA to fibrinogen plays an important role in the 
bacterial adhesion to and invasion of human epithelial cells. 



Example 5: FbsA is highly immunogenic in humans. 
Results 

Five sera from patirats were analysed for the presence of antibodies directed against 5 peptides 
(wild type <1>: GNVLERRQRDAENRSQ (SEQ ID No. 113); alanine mutant peptides: <2> 
GAVLEBIRQRDAENRSQ (SEQ ID No. 207), <3> GNALERRQRDAENRSQ (SEQ ID No. 209), 
<4> GNVLEARQRDAENRSQ (SEQ ID No. 211), <5> GNVLERAQRDAENRSQ (SEQ ID 
No. 212); see Fig. 11). Besides the wild type sequence of the repeat region, 4 peptides with 
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alanine substitutions were chosen, devoid of fibrinogen binding activity. The elimination of 
fibrinogen binding activity of the peptides was sought in order to evaluate whether fibrinogen 
may interfere with the binding antibodies. All peptides were synthesized with a N-temiinal 
biotin-tag and used as coating reagents on Streptavidin-coated ELISA plates. 

The ELISA analysis was performed with the Gemini 160 ELISA robot. IgA and IgG antibody 
levels are presented for the indicated sera with all five peptides (Fig. 15). Of the five sera chosen 
for this analysis mainly one showed a very high reactivity with the analysed peptides. Comparing 
the wild type and mutant peptides, the mutant peptides 2, 3 and 4 showed similar reactivities 
with both IgA and IgG antibodies, whereas the wild type peptide and peptide 5 were less well 
recognized by all sera. For the wild type peptide, this is probably explained by the presence of 
fibrinogen in human serum, which may compete with antibody binding to the peptide. The 
mutation in peptide 5 may have changed binding of the antibodies and therefore reduced 
reactivity. Interestingly, the reactivities of the peptides were very high with IgA antibodies and 
less pronounced with IgG, indicating that the antibody response in humans mainly involves the 
production of IgA antibodies, which are especially important for the prevention of colonization. 
These data are a strong indication that the FbsA protein is expressed in vivo during infection and 
that it is surface accessible for human antibodies. 

Example 6: Identification of additional S. agalactiae adhesions by tiie signal peptide 
tagging screen. 

Results 

For the identification of fiuther adhesins and invasins fi"om GBS, chromosomal DNA fi*om GBS 
6313 was fi-agmented by sonication, the obtained Augments were filled in by Klenow 
polymerase treatment, subsequently ligated into plasmid pHRM104 and transformed in E. coli 
CCl 18. After screening on X-phosphate containing LB-plates, four colonies were surrounded by 
a wide blue halo. The plasmids of these clones were isolated and their inserts were sequenced. 
Analysis of the obtained sequences identified four incomplete open reading fi'ames, each starting 
with a signal-peptide-encoding sequence. As the genes represented potential adhesins fi-om 
group B streptococcus, they were named pabA, pabB^ pabC^ and pabD^ respectively. 
Digoxigenin-labelled probes were amplified fi^om the four incomplete genes by PGR. The DNA 
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probes were used for screening a GBS 6313 cosmid gene bank in E. coliy resulting in the 
identification of one E. coli clone that hybridised with both the pabA and pabB probe and one E. 
coli clone that revealed hybridisation with both the pabC and pabD probe. From these clones 
cosmid DNA was isolated and the complete sequence of the genes pabA-D was determined by 
sequencing. Analysis of the obtained sequence information revealed that the pabA gene is 
located in fix>nt of the pabB gene (Fig. 16), while the pabC gene is preceding the pabD gene 
(Fig. 1 7). The genes pahAypabB^pabC^ and pabD encode proteins of 901 aa, 674 aa, 643 aa» and 
182 aa, respectively. By the method of Nielsen et aL (1997), a putative signal peptide of 32 aa, 
29 aa, 26 aa, and 23 aa could be predicted for the proteins PabA, PabB, PabC and PabD, 
respectively (Figs 16 and 17). In addition, the proteins PabA and PabB carry at their C-terminus 
the sequences IPMTG and IPQTG, respectively, which reveal high identity to cell wall anchor 
motifs of Gram-positive bacteria. By Southem Blot analysis, the genes pabA-D were detected in 
90-95% of 35 tested clinical GBS isolates, indicating a wide distribution of these genes in GBS. 

Example 7: PapA-D contribute to adhesion and invasion of GBS to human epithelial 
cells. 

Results 

To analyse the importance of the four proteins for the adhesion of GBS to q)ithelial cells, the 
genes pabA and pabB were cloned devoid of their signal peptide encoding sequence and cell wall 
anchor motif in the E. coli expression vector pET28a, placing a hexa-histidyl tag at the C- 
terminus of the PabA and PabB fusion proteins. In parallel, the genes pabC and pabD were 
cloned devoid of their signal peptide encoding sequence in pET28a, resulting in the synthesis of 
the C-terminally his-tagged fusion proteins PabC and PabD. After construction of the plasmids 
in E. coli DH5a, the constructs were transformed in E, coli BL21 (DE) and the synthesis of the 
fusion proteins was induced by the addition of IPTG. The different fusion proteins were 
subsequently purified by Ni^'*'-affinity chromatography. The proteins PabA, PabB, and PabC 
were further purified by cation exchange chromatography and the PabD protein was purified to 
homogeneity by anion exchange chromatography. The purified proteins were coated onto latex 
beads and the beads were allowed to interact with the human lung epithelial cell line A549. As a 
control, bovine serum albumin (BSA) coated beads were also allowed to bind to A549 cells. As 
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shown in Fig. 18, BSA coated beads revealed no interaction with lung epithelial cells while 
beads coated with the proteins PabA, PabB, PabC or PabD revealed significant binding to A549 
cells. This finding indicates that the proteins PabA, PabB, PabC and PabD mediate bacterial 
binding to host cells. In competition experiments, the adhesion of GBS 6313 to AS49 cells and 
the invasion of the bacteria into this cell line were quantitated in the absence and in the presence 
of purified PabA, PabB, PabC or PabD fusion protein. As shown in Fig, 19, the addition of 
PabA, PabC and PabD significantly reduced the ability of GBS 6313 to adhere to and to invade 
A549 cells. Surprisingly, the addition of PabB increased the adhesion of GBS 6313 to and the 
invasion of A549 cells. This observation again supports the idea of PabA, PabB, PabC and PabD 
being adhesins of GBS. 

To analyse this effect further, the genes pabA and pabB, respectively, were deleted in the 
chromosome of GBS 6313. The resultant mutants were tested for their adhesion to and invasion 
of epithelial cells. Compared to the parental GBS strain 6313, both mutants revealed an about 
50% reduction in their adherence to and invasion of A549 cells (Fig. 20). 

Taken together, these data suggest, that the proteins PabA, PabB, PabC and PabD, respectively, 
play a role in the adhesion of GBS to and the invasion of epithelial cells. 

To test, if the proteins PabA, PabB and PabD elicit an inmiune response in mice, purified PabA, 
PabB and PabD fusion protein was used for the subcutaneous immunization of mice. The mice 
were boosted after three weeks and serum was collected six weeks after the first immunizatiofi. 
Serial dilutions of the PabA, PabB, and PabD fusion proteins were blotted onto nitrocellulose 
and probed with the mice sera against the different proteins. As depicted in Fig. 21, the fusion 
proteins PabA, PabB and PabD were sensitively detected by their respective antisera, indicating 
a high immunogenicity of the three proteins in mice. 

Example 8: Experimental procedures II 

Bacterial strains, epithelial cells and growth conditions. 

The cell line A549 (ATCC CCL-185) and HEL299 (ATCC CCL-137) were obtained fix>m the 
American Type Culture Collection. A549 is a himian lung carcinoma cells which has many 
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characeristics of type I alveolar pneumocytes. HEL299 is a human fibroblast cell line. A549 and 
HEL299 cells were propagated in RPMI or DMEM tissue culture medium (both Gibco BRL), 
supplemented with 10% of fetal calf serum. Tissue cultures were incubated in a humid 
atmosphere at 37°C with 5% CO2. 

Construction offbsA deletion mutants in S. agalactiae. 

The JbsA gene was deleted in the S. agalactiae strains 0176 H4A, and SS1169 according to the 
procedure described previously {Schubert et al., 2002}. Briefly, the thermosensitive plasmid 
pG^/SjfbsA was transformed into the S. agalactiae strains by electroporation and transformants 
were selected by growth on erythromycin agar at 30^C. Cells in which pG^difbsA had integrated 
into the chromosome were selected by growth of the transformants at 39''C with erythromycin 
selection as described (Maguin et al^ 1996). Integrant strains were serially passaged for five 
days in liquid medium at 30®C without erythromycin selection to facilitate the excision of 
plasmid ^(j^bfbsA^ leaving the desired jbsA deletion in the chromosome. Dilutions of the serially 
passaged cultures were plated onto agar and single colonies were tested for erythromycin 
sensitivity to identify pG**"A/&5y4 excisants. Chromosomal DNA of erythromycin sensitive S. 
agalactiae excisants was tested by Southern blot after HindXH digestion using a digoxigenin- 
labelled fbsA flanking fragment as described previously {Schubert et al., 2002}. 

Preparation ofhexahistidyUtagged fusion proteins. 

The protein FbsA- 19 represents the fiill-length FljsA protein fi-om S, agalactiae 6313 and 
consists of 19 repetitive units of 16 amino acids at its N-terminus whereas protein FbsA-N 
contains the 19 N-temiinal repeats of FbsA-19 but is truncated at its C-terminus {Schubert et al., 
2002}. The Bsp protein is a surface protein of S, agalactiae that plays a role in the 
morphogenesis of the bacteria {Reinscheid et al., 2002} and served as control in the present 
study. The fusion proteins were synthesized in recombinant E, coli BL21 by the addition of 1 
mM IPTG after the culture had reached an optical density of 1.0. The cells were disrupted using 
a French Press cell and purification of the fusion protein was performed according to the 
instructions of Qiagen using Ni^* affinity chromatography. 

Adherence and invasion assays. 

Adherence of S, agalactiae to AS49 and HEL299 cells and intemalization into these cells was 
assayed essentially as described in Example 1 for AS49 cells. In some experiments, AS49 cells 
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were preincubated with different amounts of FbsA protein or FbsA-derived peptides in RPMI 
medium for 30 min with three subsequent washes with PBS. 

Scanning electron microscopy of FbsA-coated latex beads. 

Approximately 1 x 10^ latex beads (3 jmi diameter, Sigma) were washed three times in 25 mM 
2-N-morpholinoethanesulfonic acid (MES), pH 6.8. One half was resuspended in 1.0 ml MES 
buffer containing 500 ^g/ml FbsA fusion protein and the remaining half was resuspended in 1 .0 
ml MES buffer. The beads were incubated overnight at 4®C with end-over-end rotation. After 
pelleting of the beads by centrifugation, the amount of remaining protein in the supernatant was 
determined with a Bradford protein assay kit (BioRad). The beads were washed once with MES 
buffer and blocked for 1 h with lOmg/ml BSA in MES buffer at room temperature. The beads 
were washed twice with MES buffer, once with RPMI + 10% PCS, and resuspended in RPMI + 
10% PCS. Confluent A549 cells in 24-well plates were inoculated with 2x10^ beads per well in 
a total volume of 1.0 ml. The bead monolayer mixtures were incubated for 2 h at 3TC in a 5% 
CO2 atmosphere. Cells were washed five times with PBS and fixed with 3% paraformaldehyde 
and 4% glutaraldehyde in 0.1% cacodylate buffer for scanning electron microscopy. Scanning 
electron microscopy was performed with a Zeiss DSM 962 microscope. 

Example 9: The fbsA gene and protein is required in different S. agalacHae strains for 
binding to fibrinogen. 

Results 

In the serotype in S, agalactiae strain 6313 the FbsA protein was shown to be essential for the 
fibrinogen binding of this strain (Example 3). The JbsA gene had been deleted in S. agalactiae 
strains 6313, 706 S2 (serotype la) and the cs^jsule mutant O90R (Example 3). To fiirther test the 
importance of FbsA for the fibrinogen binding of S. agalactiae strains firom different serotypes, 
the fbsA gene was deleted in the genome of the 5. agalactiae strains 0176 H4A (serotype II), 
and SS1169 (serotype V). By Southern blot analysis the successful deletion of fbsA in the 
genome of the above-mentioned strains was confirmed (data not shown) and the respective 
mutants were named according to their original strain with the suffix tifbsA. The importance of 
the fbsA gene on the synthesis of fibrinogen binding proteins in different S. agalactiae was 
subsequently addressed by Western blot analysis. Equal amounts of culture supernatant of the 5. 



wo 2004/035618 




PCT/EP2003/011436 



agalactiae strains 6313, O90R, 706 S2, 0176 H4A, and SS1169 and their respective fbsA 
deletion mutants were separated by SDS-PAGE, blotted onto nitrocellulose and subesequently 
tested for the presence of fibrinogen-binding proteins. As depicted in Fig. 22, the S. agalactiae 
stains 6313 and 706 S2 reveal the presence of significant amounts of fibrinogen proteins in their 
culture supematants while the S. agalactiae strains O90R, 0176H4A and SSI 169 exhibit only 
small amounts of a fibrinogen-binding protein in their culture supematants. Also the size of the 
fibrinogen-binding proteins differs significantly between the different strains. However, FbsA is 
a highly repetitive protein with different numbers of repetitive units in different S. agalactiae 
strains. The used 5. agalactiae strains had been selected for further studies as they revealed 
significant differences in the number of repetitive units in their JbsA genes. According to the fbsA 
gene sequence fi*om the different strains, the FbsA proteins were predicted to exhibit molecular 
masses of 51 kDa, 34 kDa, 47 kDa, 20 kDa, and 71 kDa for the S. agalactiae strains 6313, 
O90R, 706 S2, 0176 H4A and SSI 169, respectively. The observed sizes of fibrinogen-binding 
proteins in the culture supematants of these strains correspond xiicely to the predicted size of the 
FbsA protein in the different strains (Fig, 22). In the culture supematants of the different fbsA 
deletion mutants, no fibrinogen binding protein could be detected. This indicates that the 
observed fibrinogen binding proteins in the culture supematants from the different strains 
represent the FbsA protein and that FbsA is the predominant fibrinogen binding protein in the 
culture supernatant of all tested strains. . 

The different 5. agalactiae strains and their jbsA mutants were tested for binding of ^^^I-labelled 
fibrinogen on their surface. S, agalactiae 6313 revealed significant binding of radiolabelled 
fibrinogen. However, the strains O90R and 706 S2 exhibited moderate and the strains 0176 H4A 
and SSI 169 weak binding of human fibrinogen. The differences in the fibrinogen binding of the 
different strains did not correlate with the number of fibrinogen-binding repeats in the FbsA 
proteins of these strains. However, in the fbsA deletion mutants, fibrinogen binding was reduced 
to values of 1% to 3%. Similarly, in binding experiments using FITC-labelled bacteria, about 
45%, 18%, 14%, 4% and 7% of the total bacteria of the strains 6313, O90R, 706 S2, 0176 H4A 
and SSI 169 bound to immobilized fibrinogen, while less than 2% of the respective fbsA mutants 
bound to immobilized fibrinogen (Fig. 23). These results further show that FbsA is the major 
fibrinogen-binding protein in the analyzed S, agalactiae strains and that it mediates the binding 
of the bacteria both to soluble and to immobilized fibrinogen. 
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Example 10: The fbsA gene and protein is required for efficient attaciiment of S. 
agalactiae to and internalization into human cells. 



The S, agalactiae strains 6313, O90R, 706 S2, 0176 H4A and SS1169 and their isogenic fbsA 
mutants were tested for their capability to adhere to and to invade the himian Ixmg epithelial cell 
line A549. As shown in Fig. 24, S. agalactiae strain 6313 bound to and invaded A549 cells in 
high numbers whereas the strains O90R, 706 S2 and SSI 169 revealed a moderate adherence to 
and intemalization into A549 cells. In contrast, the S, agalactiae strain 0176 H4A adhered to 
and invaded A549 cells in very low numbers. Irrespective from the initial differences of the 
various strains to adhere to and to invade A549 cells, the deletion of the fbsA gene in the 
different strains reduced the adherence to and the invasion into A549 cells to very low but 
similar values among the different strains. Only in strain 0176 H4A, that already showed little 
intemalization into AS49 cells, did the deletion of the JbsA gene not reduce the intemalization of 
the bacteria into AS49 cells. These findings indicate an important role of the fbsA gene for the 
adhesion of 5. agalactiae to and the intemalization into human epithelial cells. To assess the role 
of the JbsA gene for the binding of S. agalactiae to a different cell line, we analyzed with the 
human fibroblast cell line HEL299 the adherence and intemalization of S. agalactiae 6313 and 
its fbsA deletion mutant. As shown in Fig. 25, the binding of strain 6313 AfbsA to HEL299 cells 
and the intemalization of the bacteria into this cell line was reduced by about 90%, These data 
suggest, that the JbsA gene is of general importance for the adherence and intemalization of S, 
agalactiae into different human cells. 

To assess the role of the FbsA protein in the bacterial adherence and intemalization, the effect of 
pre-treatment of eukaryotic cells with FbsA- 19 fusion protein on the adherence and invasion of 
S. agalactiae 6313 was evaluated. The protein FbsA-19 represents the FbsA protein from strain 
6313 and carries 19 repetitive units. As shown in Fig. 26, pre-treatment of A549 cells with 
increasing amounts of FbsA- 19 protein substantially inhibited the adherence and invasion of this 
cell line by 5. agalactiae 6313. Of note, we also found a correlation between the reduction in 
bacterial adherence and the invasion in HEL299 cells. 



Results 



The FbsA protein was previously shown to bind to fibrinogen (Example 3). We therefore tested 
the effect of a pre-incubation of S. agalactiae 6313 with fibrinogen on the bacterial adherence 
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and invasion of A549 cells. We observed a dose-dependent inhibition of the bacterial adherence 
and invasion of A549 cells by pre-incubating S. agalactiae 6313 with 0.1 jig/ml to 1.0 ^g/ml of 
fibrinogen (data not shown). However, the microscopic inspection of the bacteria revealed 
clumping of the bacteria with increasing amounts of fibrinogen. The observed inhibition of 
bacterial adherence and invasion by fibrinogen may therefore be attributed to either the blocking 
of the FbsA protein on the surface of the bacteria or the clumping of the bacteria due to several 
fibrinogen binding sites in the FbsA protein. We also tested the influence of fibronectin on the 
adherence and invasion of S. agalactiae 6313, however, even 10 (xg/ml fibronectin did not exert 
an inhibitory effect on bacterial adherence and intemalization (data not shown). 

Example 11: FbsA-coated latex beads adhere to A549 cells. 
Results 

The previous experiments already indicated a role of FbsA in the interaction between S. 
agalactiae and the host cell. To investigate if the interaction of FbsA with eukaryotic cells 
required additional factors, latex beads were coated with FbsA-19 protein and tested for their 
interaction with human AS49 cells. As a control, BSA coated latex beads were also analyzed for 
their interaction with AS49 cells. By scaiming electron microscopy only a few BSA coated latex 
beads were found to bind to A549 cells, while the FbsA- 19 coated beads bound to A549 cells in 
high nimibers (Fig. 27). Attachment of the FbsA- 19 coated beads to the plasma membrane was 
characterized by contact with microvilli and structures that resembled early pseudopod formation 
(Fig. 27C). In some cases, the pseudopod appeared to surround the surface of the bead, 
indicating that the bead was finally internalized (Fig. 27D). However, the intemalization of 
FbsA- 19 coated beads was observed rather rarely, indicating that FbsA- 19 does not usually 
trigger the uptake of 5. agalactiae or FbsA- 19 coated beads into eukaryotic cells. 
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Example 12: The genes pabC and pabD are co-transcribed and conserved in clinical 
strains of S. agalactiae 

Results 

The genomic organisation of the region encompassing the pabC gene is shown in Fig. 31. Using 
RT-PCR and oligonucleotides suitable to amplify overlapping regions of the respective of the 
four genes, it was shown that the pabC and the gbsOSS 1 (pabD) gene is transcribed as a single 
transcript, whereas RNA polymerase produces independent transcripts for the metK and gbs0853 
genes (Fig. 28). This result indicates that the pabC and the gbsOSSl gene products may display a 
function required for the same or a similar process for 5. agalactiae. 

In order to determine, whether the genes encoding PabC and gbsOSSl are conserved in the 
various serotypes and clinical isolates of GBS, chromosomal DNA of 33 different S, agalactiae- 
strains was isolated and subjected to PGR analysis with specific primers amplifying the entire 
gene. The two genes pabC und gbsOSSl were shown to be present in all tested strains. Fig. 29 
shows as an example the PGR results for the most prevalent serotypes of GBS, la, lb. III and V. 
The gbsOSSl -gene was amplified from all strains with an identical length, indicative of the 
conservation of the sequence as well as the size of the gene. The PGR of the pabC gene resulted 
surprisingly in the amplification of two differently sized products dependent on the strain used 
for analysis, with size differences also observed in strains of the same serotype. The comparison 
of the amino acid sequence of the PabG protein from S. agalactiae 63 13 (serotype HI), S. 
agalactiae NEM3 16 (serotype III) and S. agalactiae 2003 V_R (serotype V) is shown in Figure 
30. It shows that the PabG proteins from 5. agalactiae 63 13 and S. agalactiae NEM3 16 are 
identical, but clear differences are obvious in PabC from 5. agalactiae 2003 V_R. The 
divergence in sequence of PabC can entirely be attributed to the N-terminal part of the protein, 
whereas the G-terminal part is almost identical in all three serotypes. The observed difference in 
size is also in agreement with the PGR results in Fig, 29. Further PGR experiments confirmed 
that the differences in size stem from sequence variations in the S' part of the gene rather than 
the 3' terminal part (data not shown). 
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Example 13: PabC from S. agalactiae binds human fibrinogen and is involved in invasion 
of euicaryotic ceils. 

Initial experiments showed that PabC binds to the a-subunit of fibrinogen (Fig. 31 A, C). In order 
to delineate which region of PabC is responsible for fibrinogen binding, the entire protein as well 
as the N-terminal and the C-tenninal part of PabC were expressed as His-tagged fiision proteins. 
After addition of fibrinogen, binding was detected with antibodies directed against fibrinogen 
(Fig. 3 IB). This experiment showed that the conserved C-terminal part of PabC is in itself 
devoid of fibrinogen binding activity, while the N-temiinal part is sufficient to provide this 
activity to a similar extent as the full-length protein. 

To confirm the Western blot results, a Capture ELIS A assay was performed with the same 
purified PabC protein derivatives (Fig. 32). For this purpose 2 |xg Fibrinogen were coated per 
well overnight at 4'^C. The binding activities of increasing concentrations of PabC derivatives 
were quantified via a His-tag antibody based Peroxidase assay. The Capture ELIS A experiments 
confirmed the results that the N-terminal part of the PabC protein is harbouring the fibrinogen- 
binding region. 

It is shown in Fig, 19 that relatively large concentrations of PabC can inhibit the adherence of 5. 
agalactiae to and invasion into eukaryotic A549 cells. Using lower concentrations of 
recombinant PabC protein, it becomes evident, that PabC is most likely facilitating invasion 
rather than adherence of GBS (Fig. 33C, D). A confirmative result was obtained with the pabC 
deletion mutant, which also showed reduced invasion into A549 cells (Fig. 33A, B). These 
results suggest that PabC may serve S. agalactiae as an invasin to colonize eukaryotic cells. 



The following is a list of all of the publications and documents referred to herein. It is to be 
understood that the whole disclosure of these references is hereby incorporated herein by 
reference. 
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The features of the present invention disclosed in the specification, the claims and/or the 
drawings may both separately and in any combination thereof be material for realizing the 
invention in various forms thereof. 
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Claims 

An isolated nucleic acid molecule, preferably encoding a fibrinogen-binding-polypeptide 
or a fragment thereof, comprising a nucleic acid sequence which is selected from the 
group comprising 

a) a nucleic acid having at least 70% identity to a nucleic acid sequence which is 
selected from the group comprising SEQ ID NO 1 to SEQ ID NO 6. 

b) a nucleic acid which is essentially complementary to the nucleic acid of a), 

c) a nucleic acid comprising at least 15 sequential bases of the nucleic acid of a) or 
b), 

d) a nucleic acid which anneals under stringent hybridisation conditions to the 
polynucleotide of a), b) or c) and 

e) a nucleic acid which, but for ttie degeneracy of the genetic code, would hybridize 
to the nucleic acid defined in a), b), c) or d). 

An isolated nucleic acid molecule, preferably encoding an adhesion factor or a fragment 
thereof, comprising a nucleic acid sequence which is selected from the group comprising 

a) a nucleic acid having at least 70% identity to a nucleic acid sequence set forth in 
SeqID NO 7, SeqID NO 8, SeqID NO 9 or SeqID NO 10. 

b) a nucleic acid which is essentially complementary to the nucleic acid of a), 

c) a nucleic acid comprising at least IS sequential bases of the nucleic acid of a) or 
b). 
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d) a nucleic acid which anneals under stringent hybridisation conditions to the 
nucleic acid of a), b) or c) and 

e) a nucleic acid which, but for the degeneracy of the genetic code, would hybridize 
to the nucleic acid defined in a), b), c) or d). 

3. The isolated nucleic acid molecule according to claim 1 or 2, whereby the identity is at 
least 80 %, preferably at least 90 %, more preferably 100 %. 

4. The isolated nucleic acid molecule according to claim 1 or 3, whereby the nucleic acid 
molecule encodes a fibrinogen-binding-protein comprising at least one repeat of an 
amino acid motive comprising 16 amino acids. 

5. The isolated nucleic acid molecule according to claim 4, whereby the encoded 
fibrinogen-binding protein comprises 19 repeats of the amino acid motive whereby the 
amino acid motive is the one specified in any of claims 7 and 15. 

6. The isolated nucleic acid molecule according to claims 2 or 3, whereby the nucleic acid 
molecule mcodes an adhesion factor which interacts with epithelial cells, preferably 
human epithelial cells. 

7. An isolated nucleic acid molecule encoding for a polypeptide whereby the polypeptide 
comprises an amino acid motive, whereby the amino acid motive is G-N/S/T-V-L- 
A/E/M/Q-R-R-X-K/RAV-A/D/E/N/Q-A/F/I/^ (SEQ ID NO 222). 

8. The nucleic acid according to any of claims 1 to 7, wherein the nucleic acid is DNA, 
RNA or mixtures thereof, preferably the nucleic acid molecule is isolated from a genomic 
DNA. 

9. A vector comprising a nucleic acid molecule according to any of claims 1 to 8. 
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10. The vector according to claim 8, wherein the vector is adapted for recombinant 
expression of the polypeptide encoded by any of the nucleic acid molecules according to 
any of claims 1 to 8. 

11. A cell, preferably a host cell, comprising the vector according to claim 9 or 10- 

12. A polypeptide, preferably a fibrinogen-binding-polypeptide and/or an adhesion factor, 
comprising an amino acid sequence, whereby the amino acid sequence is encoded by a 
nucleic acid molecule according to any one of claims 1 to 8, and fragments of said 
polypeptide. 

13. A polypeptide, preferably a fibrinogen-binding-polypeptide and/or an adhesion factor, 
comprising an amino acid sequence, whereby the amino acid sequence is selected from 
the group comprising Seq ID NO 1 1 to 20. 



14. A polypeptide, preferably a fibrinogen-binding-polypeptide and/or an adhesion factor, 
comprising an amino acid sequence, whereby the amino acid sequence is selected from 
the group comprising Seq ID NO 1 13 to 205. 

15. A polypeptide, preferably a fibrinogen-binding-polypeptide and/or an adhesion factor, 
comprising an amino acid motive, whereby the polypeptide comprises an amino acid 
motive, whereby the amino acid motive is G-N/S/T-V-L-A/E/M/Q-R-R-X-K/R/W- 
A/D/E/N/Q-A/F/I/iyVA^-X-X-K/R-X-X (SEQ ID NO 222). 

16. A process for producing a polypeptide according to any of claims 12 to 15 or a fragment 
thereof, comprising expressing the nucleic acid molecule according to any of claims 1 to 
8. 



17. 



A process for producing a cell which expresses a polypeptide according to any of claims 
12 to 15 or a fragment thereof, comprising transforming or transfecting a suitable host 
cell with the vector according to claim 9 or 10. 
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18. A pharmaceutical composition, especially a vaccine, comprising a polypeptide or a 
fragment thereof, as defined in any one of claims 12 to 15 or a nucleic acid molecule 
according to any of claims 1 to 8. 

19. The pharmaceutical composition according to claim 18, characterized in that it comprises 
an inmiunostimulatory substance, whereby the immunostimulatory substance is 
preferably selected from the group comprising polycationic polymers, 
immunostimulatory deoxynucleotides (ODNs), synthetic KLK peptides, neuroactive 
compounds, alimm, Freund's complete or incomplete adjuvants or combinations thereof. 

20. Use of a polypeptide according to any one of the claims 12 to 15 or a fragment thereof 
for the manufacture of a medicament, especially for the manufacture of a vaccine against 
bacterial infection, 

21. An antibody, or at least an effective part thereof, which binds at least to a selective part of 
the polypeptide or a fragment thereof according to claims 12 to 15. 

22. The antibody according to claim 21, wherein the antibody is selected from the group 
comprising monoclonal antibodies, polyclonal antibodies, chimeric antibodies, 
humanized antibodies and fragments of each thereof. 

23. Use of a polypeptide according to any of the claims 12 to 15 or a fragment thereof, for the 
manufacture of an antibody. 

24. Use of the antibody according to claim 21 or 22 for the preparation of a medicament for 
treating or preventing bacterial infections, especially Streptococcus agalactiae infections. 

25. A method for identifying an antagonist capable of reducing or inhibiting the activity of 
the polypeptide or fragment thereof according to any of the claims 12 to 15 or which is 
capable of binding to the polypeptide according to any of claims 12 to 15 comprising: 

a) contacting an isolated or immobilized polypeptide according to any of the claims 
12 - 15 or a fragment thereof with a candidate antagonist under conditions to 
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permit binding of said candidate antagonist to said polypeptide or fragment 
thereof, in the presence of a component capable of providing a detectable signal in 
response to the binding of the candidate antagonist to said polypeptide or 
fragment thereof; and 

b) detecting the presence or absence of a signal gmerated in response to the binding 
of the antagonist to the polypeptide or fragment thereof, preferably the presence of 
a signal indicating a compound capable of inhibiting or reducing the activity of 
the polypeptide or fragment thereof. 

26. A method for identifying an antagonist capable of reducing or inhibiting the activity of a 
polypeptide or a fragment thereof according to any of claims 12 to 15 comprising: 

a) providing the polypeptide according to any of the claims 12 to 15 or a fragment 
thereof, 

b) providing an interaction partner of the polypeptide according to any of the claims 
12 to 15, preferably an antibody according to claim 21 or 22. 

c) providing a candidate antagonist, 

d) reacting the polypeptide, the interaction partner of the polypeptide and the 
candidate antagonist, and 

e) determining whether the candidate antagonist inhibits or reduces the activity of 
the polypeptide. 

27, A method for identifying an antagonist capable of reducing or inhibiting the interaction 
activity of the polypeptide according to any of claims 12 to 15 or a fragment thereof to its 
interaction partner comprising: 

a) providing the polypeptide according to any of claims 12 to 15 or a fragment 
thereof. 
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b) providing an interaction partner to said polypeptide or a fragment thereof, 
preferably an antibody according to claim 21 or 22, 

c) allowing interaction of said polypeptide or fragment thereof to said interaction 
partner to form an interaction complex, 

d) providing a candidate antagonist, 

e) allowing a competition reaction to occur between the candidate antagonist and the 
interaction complex, and 

f) determining whether the candidate antagonist inhibits or reduces the interaction 
activities of the polypeptide or the fragment thereof with the interaction partner. 

28. An antagonist identified or identifiable by a method according to claim 26 or 27. 

29. A process for in vitro diagnosis of a bacterial infection, preferably Streptococcus 
agalactiae infection, comprising the step of determining the presence of a nucleic acid 
molecule according to any of the preceding claims, or of a polypeptide according to any 
of the preceding claims. 

30. A process for in vitro diagnosing a disease related to expression of the polypeptide 
according to any of claims 12 to IS or a fragment thereof, comprising determining the 
presence of a nucleic acid sequence encoding said polypeptide or a fragment thereof 
according to any of claims 1 to 8, or the presence of the polypeptide according to any of 
claims 12 to 15 or a fragment thereof 

31. An affinity device comprising a support material and immobilized to said support 
material a polypeptide according to any of the preceding claims or a nucleic acid 
molecule according to any of the preceding claims. 
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32. 

33, 
34. 

35. 

36. 
37. 



Use of a polypeptide according to any of the preceding claims or a fragment thereof for 
the isolation and/or purification and/or identification of an interaction partner of said 
polypeptide or a fragment thereof 

Use of any of the polypeptides according to any of the preceding claims for the 
generation of a peptide binding to said polypeptide. 

The use according to claim 33, whereby the peptide is selected from the group 
comprising anticalines. 

Use of a polypeptide according to any of the preceding claims for the manufacture or 
generation of a functional nucleic acid, whereby the functional nucleic acid is selected 
from the group comprising aptamers and spiegelmers. 

Use of a polypeptide according to any of the preceding claims as an antigen. 

Use of a nucleic acid according to any of claims 1 to 8, for the manufacture or generation 
of a functional ribonucleic acid, wherein the functional ribonucleic acid is selected from 
the group comprising ribozymes, antisense nucleic acids and siRNA. 
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1 

GATCATTAAATAAATCAAGGTTAGTTAGCTTGAAAGATATAAATATATTCCAAAATTCCA 
61 

T^AAAGTAATTGGCATAGTGACAAAAACTATTGCTCCCCTGCTTTAGAAATAATTTATTTT 
121 

TAATTTAATATTAAAAGTAAACTGAAGAATCTAGTTATATTTAAAAAGTAAAGGTTGCAT 
181 

TTTAACTAAATTATGTTAAACTACTGTTATGCGATGAGTCGATATGTGGTTTTACCACTA 
241 

TTGCGCAGGGAGATTATAAACGCAGGAGCGGATCTTGATAAGTTGTGTGAACCTTCTTGT 

301 

CACACTTGAAAAGGTGCCCTTAGCTTACTACTACTTGTAATTTCTTACAAATTGTGGTAA 
361 

GTAGCTGAAAAGCAAAAAAGAAAGAACCAGTTTGGTTCTTTCTTTTTTGCATAAATAAGT 

421 

CACAATTTCCTTCTTAAAATTATGTCTTTACTTAACTTTAATTGAATATGCTACCATCAC 
481 

ATTCTTTGTAAAATTTTTAAATAATCTAGTTTCTGATGGTTTAGATGAAGTATTAAAAAT 
541 

ATACTATTACCTCATTGTAAATCTTAATGTTAGTATGACTATCTATCATGCTTTATAATA 
601 

TTAAAGGAAAATTTAAAAATATCATGTTTTAGATATCAACTATTTAATTTTAAACATACA 
661 

AATT/^TAATAAATTGCAACTAAATAATAAATTATCTTGACATAACTTATAAAATGTTTT 
721 

AATATATAATCTAAATAAAAGTAATAATAAAATGACTTTTAAAATTTAAAAAAAGT AAGG 
781 RBS 
AGA AAATTAATTGTTCAATAAAATAGGTTTTAGjy^CTTGGAAATCAGGAAAGCTTTGGCT 
841 MFNKIGPRTWKSGKL W L 

TTATATGGGAGTGCTAGGATCAACTATTATTTTAGGATCAAGTCCTGTATCTGCTATGGA 
YMGVLGSTIILG33PVSA MD 
901 I ^ Repeat 1 (SEQID21) 

TAGTGTTGGAAATCAAAGTCAGGGCAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAAA 
SVGNQSQGNVLERRQRDAEN 

961 ► Repeat 2 (SEQ ID 22) Repeat 3 (SEQ ID 23) , ^ 

CAGAAGCCAAbGCAATGTTCTAGAGCGTCGTCAACGCGATGTTGAGAATAAGAGCCAAGG 
RSQGNVLERRQRDVENKSQG 

1021 Repeat 4 (SEQ ID 24) | ► 

CAATGTTTTAGAGCGTCGTCAACGTGATGCGGAAAACAAGAGCCAAGGCAATGTTTTAGA 
NVLERRQRDAENKSQGNVLE 
1081 Repeat 5 (SEQ ID 25) | ► 

GCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGCAATGTTCTAGAGCGTCGTCAACG 
RRQRDAENRSQGNVLERRQR 
1141 Repeat 6 (SEQ ID 26) | ► 

TGATGCAGAAAACAGAAGCCAAGGCAATGTTCTAGAGCGTCGTCAACGCGATGCAGAi\AA 
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DAENRSQGNVLERRQRDAEN 
1201 ^ Repeat 7 (SEQ ID 27) Repeat 8 (SEQ [D 28) ^ 

CAGAAGCCAAbGTAATGTTCTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGG 
RSQGNVLERRQRDAENRSQG 
1261 Repeat 9 (SEQ ID 29) | ► 

TAATGTTCTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGTAATGTTCTAGA 
NVLERRQRDAENRSQGNVLE 
1321 Repeat 1 0 (SEQ ID 30) | ► 

GCGTCGTCAACGCGATGTTGAGAATAAGAGCCAAGGCAATGTTTTAGAGCGTCGTCAACG 
RRQRDVENKSQGNVLERRQR 
1381 Repeat U (SEQ ID 31) | ^ 

TGATGCGGAAAACAAGAGCCAAGGCAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAAA 
DAENKSQGNVLERRQRDAEN 

1441 I ► Repeat 12 (SEQ ID 32) Repeat 1 3 (SEQ ID 33) | ► 

CAGAAGCCAAGGCAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGG 
RSQGNVLERRQRDAENRSQG 

1501 Repeat 14 (SEQ ID 34) | ^ 

CAATGTTCTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGCAATGTTCTAGA 

NVLERRQRDAENRSQGNVLE 
1551 Repeat 15 (SEQ ID 35) | ► 

GCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGCAATGTTCTAGAGCGTCGTCAACG 
RRQRDAENRSQGNVL ERRQR 

Reoeat 16 fSEO ID 36) | ► 

CGATGCAGAAAACAGAAGCCAA(3GTAATGTTCTAGAGCGTCGTCAACGTGATGCAGAAAA 

DAENRSQGNVLERRQRDAEN 
1531 I ► Repeat 1 7 (SEQ ID 37) Repeat 1 8 (SEQ ID 38) | ► 

CAGAAGCCAAGGCAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAT^CAGAAGCCAAGG 
RSQGNVLERRQRDAENRSQG 

1741 Reocat 1 9 fSEO ID 39) ( ► 

CAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGCAATGTTTTAGA 
NVLERRQRDAENRSQGNVLE 

1801 

GCGTCGTCAACGTGATGCGGAAAACAAGAGCCAAGTAGGTCAACTTATAGGGAAAAATCC 
RRQRDAENKSQVGQLI GKNP 
1861 

ACTTCTTTCAAAGTCAATTATATCTAGAGAAAATAATCACTCGAGTCAAGGTGACTCTAA 
LLSKSII SRENNHSSQGDSN 
1921 

CAAACAGTCATTCTCTAAAAAAGTATCTCAGGTTACTAATGTAGCTAATAGACCGATGTT 
KQSFSKKVSQVTNVANRPML 
1981 

AACTAATAATTCTAGAACAATTTCAGTGATAAATA7VATTACCTAAAACAGGTGATGATCA 
TNNSRTI SVINK L P K T G D D Q 
2041 
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AAATGTCATTTTTAAACTTGTAGGTTTTGGTTTAATTTTGTTAACAAGTCGCTGCGGTTT 
NVIFKLVGFGLILLTSRCGL 
2101 

GAGACGCAATGAAAATTAAGTATAATCAATCATTTAGTAACTATATATAATGATATATGC 
R R N E N * 



AATCAATAAAAAGGAATCGGATACGAGATTCCTTTTTATAATTAGGTTGGTTAGGGTGAC 
2221 

TTTTTTCATTTGGCTATTCTTGAAAGTTTATAAAAATGTAGTTATAATAGTCACATTAAA 
2281 

ATGTTTTGAAAATATTGATGAACAACATCAACAAATAGAGGTCATTATATGGGATATACC 
2341 

GTTGCTATCGTAGGTGCTACAGGTGCCGTAGGAACACAAATGATTCGTCAATTAGAACAA 
2401 

TCGAATTTACCAATAGAACAAGTGAAACTTTTATCATCAAGTCGCTCAGCAGGTAAAATT 
2461 

TTACATTTTAAAGATGAGGCTATACGTGTTGAAGAGACAACAAAAGAATCATTTTACGAT 
2521 

GTTGATATTGCCTTGTTTTCAGCTGGTGGATC 
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1 

GCATAAATAAGTCACAATTTCCTTCTTAAAATTATGTCTTTACTTAACTTTAATTGAATA 
61 

TGCTACCATCACATTCTTTGTAA/^TTTTTAAATAATCTAGTTTCTGATGGTTTAGATGA 
121 

AGTATTAAAAATATACTATTACCTCATTGTAAATCTTAATGTTAGTATGACTATCTATCA 
181 

TGCTTTATAATATTAAAGGAAAATTTAAAAATATCATGTTTTAGATATCAACTATTTAAT 
241 

TTTAAACATACAAATTAATAATAAATTGCAACTAAATAATAAATTATCTTGACATAACTT 

301 

ATAAAATGTTTTAATATATAATCTAAATAAAAGTAATAATAAAATGACTTTTAAAATTTA 
361 

AAAAAAGT AAGGAGAA AATTAATTGTTCAATAAAATAGGTTTTAGAACTTGGAAATCAGG 
421 RBS MFNKIGFRTV7K3G 

AAAGCTTTGGCTTTATATGGGAGTGCTAGGATCAACTATTATTTTAGGATCAAGTTCTGT 
KLWLYMGVLGSTIILGSSSV 

4 81 Repeat 1 (SEQ ID 40) j ^ 

ATCTGCTATGGATAGTGTTGGAAATCAAAGTCAGGGCAATGTTTTAGAGCGTCGTCAACG 

SAMDSVGNQSQGNVLERRQR 
Repeat 2 (SEQID41) p— ► 

CGATGCAGAAAACAGAAGCCAAGGCAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAAA 

DAENRSQGNVLERRQRDAEN 
gQIL I ► Repeat 3 (SEQ ID 42) Repeat 4 (SEQ ID 43) | ► 

CAGAAGCCAAGGCAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGG 
RSQGNVLERRQRDAENRSQG 
661 Repeats (SEQ ID 44) | ^ 

TAATGTTCTAGAGCGTCGTCAACGCGATGTTGAAAATAAAAGCCAAGGCAATGTTTTAGA 

NVLERRQRDVENKSQGNVLE 

Kf^pesA 6 (SEQ ID 45) j ^ 

GCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGTAATGTTCTAGAGCGTCGTCAACG 

RRQRDAENRSQGNVLERRQR 
781 Repeat? (SEQ ID 46) | ^ 

CGATGTTGAAAATAAAAGCCAAGGC^TGTTTTAGAGCGTCGTCAACGTGATGCAGAAAA 
DVENKSQGNVLERRQRDAEN 

I ► Repeats (SEQ ID 47) Repeat 9 (SEQ ID 48) | ^ 

CAGAAGCCAAGGTAATGTTCTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAACiG 

RSQGNVLERRQRDAENRSQG 
901 Repeat 10 (SEQ ID 49) | ^ 

CAATGTTTTAGAGCGTCGTCAACGCGATGCAGAAAACAGAAGCCAAGGCAATGTTCTAGA 
NVLERRQRDAENRSQGNVLE 
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961 Repeat 11 (SEQ ID 50) | ^ 

GCGTCGTCAACGTGATGCTGAAAACAAAAGCCAAdGCAATGTTTTAGAGCGTCGTCAACG 
RRQRDAENKSQGNVLERRQR 
1021 Repeat 12 (SEQ ID 51) j ^ 

TGATGCAGAAAACAGAAGCCAAGGCAATGTTTTAGAGCGTCGTCAACGTGATGCTGAAAA 
DAENRSQGNVLERRQRDAEN 

1081 I ^ Repeat 13 (SEQ ID 52) Repeat 14 (SEQ ID 53) j ^ 

CAGAAGCCAAbGCAATGTTTTAGAGCGTCGTCAACGCGATGCAGAAAACAGAAGCCAA(iG 
RSQGNVLERRQRDAENRSQG 

11^1 Repeat 15 (SEQ ID 54) | ^ 

TAATGTTCTAGAGCGTCGTCAACGTGATGCGGAAAACAAGAGCCAAGGCAATGTTTTAGA 
NVLERRQRDAENKSQGNVLE 
12 01 Repeat 16 (SEQ ID 55) | ^ 

GCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGCAATGTTTTAGAGCGTCGTCAACG 
RRQRDAENRSQGNVLERRQR 
1261 Repeat 17 (SEQ ID 56) | ^ 

CGATGTTGAGAATAAGAGCCAAGGCAATGTTTTAGAGCGTCGTCAACGTGATGCGGAAAA 
DVENKSQGNVLERRQRDAEN 
1321 

CAAGAGCCAAGTAGGTCAACTTATAGGGAAAT^TCCACTTCTTTCAAAGTCAATTATATC 
KSQVGQLIGKNPLLSKSI IS 
1381 

TAGAGATU^TAATCACTCTAGTCAAGGTGACTCTAACAAACAGTCATTCTCTAA/VAAAGT 
RENNHSSQGDSNKQSFSKKV 
1441 

ATCTCAGGTTACTAATGTAGCTAATAGACCGATGTTAACTAATAATTCTAGAACAATTTC 
SQVTNVANRPMLTNNSRT I S 
1501 

AGTGATAAATAAATTACCTAAAACAGGTGATGATCAAAATGTCATTTTTAAACTTGTAGG 
V I N K L P K T G DDQNVI FKLVG 
1561 

TTTTGGTTTAATTTTGTTAACAAGTCGCTGCGGTTTGAGACGCAATGAAAATTAAGTATA 
FGLILLTSRCGLRRNEN* 

1621 ^ 

ATCT^TCATTTAGTAACTATATATAATGATATATGCAATCAATAAAAAGGAATCGGATAC 



GAGATTCCTTTTTATAATTAGGTTGGTTAGGGTGACTTTTTTCATTTGGCTATTCTTGAA 
1741 1761 1781 

AGTTTATAAAAATGTAGTATAATAGTCACATTAAAATGTTTTGAAAATATTGATGAACAA 
1801 

CATCAACAAATAGAGGTCAT 
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1 

GCATAAATAAGTCACT^TTTCCTTCTAAAAATTATGTCTTTACTTAACTTTAATTGAATA 
61 

TGCTACCATCACATTCTTTGTAAAATTTTTAAATAATCTAGTTTCTGATGGTTTAGATGA 

121 

AGTATTAAAAATATACTATTATCTCATTGTAAATCCTAATGTTAGTATGACTATCTATCA 
181 

TGTTTTATAATATTGAAGGAAAATTTAAAAATATCATGTTTTAGATATCAACTATTTAAT 
241 

TTTA/^CATACAAATTAATAATAAATTGCAATTAAATAACAAATTACCTTGACATAAATT 

301 

ATAAAATGTTTTAATATATATAATCTAAATAAAAATAATAATAAAATGACTTTTAAAATT 
361 

TAAAAAAAG TAAGGAGAA AATTAATTGTTC?IATAAAATAGGTTTTAGAACTTGGAAATCA 
421 RBS MFNKIGFR T W K S 

GGAAAGCTTTGGCTTTATATGGGAGTGCTAGGATCAACTATTATTTTAGGATCAAGTCCT 

GKLWLYMGVLG3TIILGSSP 
4 81 Repeat! (SEQ ID 57) | ^ 

GTATCTGCTATGGATAGTGTTGGAAATCAAAGTCAAGGTAATGTTCTAGAGCGTCGTCAA 

VSAMDSVGNQSQGNVLERRQ 
Repeat 2 (SEQ ID 58) ^ ^ 

CGTGATGCGGATAACAAGAGCCAAGGCAATGTTCTAGAACGTCGTCAACGCGATGTAGAA 

RDADNKSQGNVLERRQRDVE 
601 j ► Repeats (SEQID59) 

AACAGAAGCCAAGGCAATGTTCTAGAGCGTCGTCAACGCGATGCGGATAACAAGAGCCAA 

NRSQGNVLERRQRDADNKSQ 
1 ^ Repeal 4 (SEQ ID 60) Repeats (SEQ ID 61) i ^ 

GGCAATGTTTTAGAGCGCCGCCAACGCGATGCAGAAAACAAAAGTCAGQGCAATGTTCTA 

GNVLERRQRDAENKSQGNVL 
721 Repeat 6 (SEQ ID 62) | ^ 

GAACGTCGTCAACGTGATGTTGAGAATAAGAGCCAAGGCAATGTTCTAGAGCGTCGCCAA 

ERRQRDVENKSQGNVLERRQ 
781 Repeat? (SEQ ID 63) j ^ 

CGTGATGCAGAAAACAAAAGTCAGdGTAATGTTCTAGAGCGTCGTCAACGCGATGCAGAT 

RDAENKSQGNVLERRQRDAD 
841 I ► Repeats (SEQ ID 64) 

AACAAGAGCCAAGGTAATGTTCTAGT^CGTCGTCAACGCGATGTGGAAAACAAAAGTCAG 

NKSQGNVLERRQRDVENKSQ 
I ^ Repeat 9 (SEQ ID 65) Repeat 10 (SEQ ID 66) j ^ 

GGCAATGTTCTAGAACGTCGTCAACGTGATGTTGAGAATAAGAGCCAAGGCAATGTTCTA 

GNVLERRQRDVENKSQGNVL 
g^2. Repeat 1 1 (SEQ ID 67) | ^ 

GAGCGTCGCCAACGTGATGCAGA?WUVCAAAAGTCAGGGTAATGTTCTAGAGCGTCGTCAA 
ERRQRDAENKSQGNVLE RRQ 



Fig. 4-1 
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t 



1021 



Repeat 12 (SEQID68) 



CGCGATGCAGATAACAAGAGCCAAGGTAATGTTCTAGAACGTCGTCAACGCGATGTGGAA 
RDADNKSQGNVLERRQRDVE 



AACAAAAGTCAGGGCAATGTTCTAGAGCGTCGCCAACGTGATGTTGAGAACAAGAGCCAA 

NKSQGNVLERRQRDVENKSQ 
1141 

GTAGGTCAACTTATAGGGAAAAATCCACTTCTTTCAAAGTCAACTATATCTAGAGAAAAT 

VGQLIGKNPLLSKSTISREN 
1201 

AATCACTCTAGTCAAGGTGACTCTAACAAACAGTCATTCTCTAAAAAAGTATCTCAGGTT 

NHSSQGDSNKQSFSKKVSQV 
1261 

ACTAATGTAGCTAATAGACCAATGTTAACTAATAATTCTAGAACAATTTCAGTGATAAAT 

TNVANRPMLTNNSRTISVIN 
1321 

AAATTACCTAAAACAGGTGATGATCAAAATGTCATTTTTAAACTTGTAGGTTTTGGTTTA 

K L P K T G DDQNVIFKLVGFGL 
1381 

ATTTTGTTAACAAGTCGCTGCGGTTTGAGACGCAATGAAAATTAAGTATAATCAATCATT 
ILLTSRCGLRRNEN* 



TTATAATTAGGTTGGTTAGGGTGACTTTTTCATTTGGCTATTCTTGAi\AGTTTATA7^AAA 
1561 

TGTAGTATAATAGTCACATTAAAATGTTTTGAAAATATTGATGAACAACATCAACAAATA 
1621 

GAGGTCAT 



1081 



^ Repeat 13 (SEQ ID 69) 




Fig, 4-2 
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1 

GCATAAATAAGTCACCAATTTCCCTTCTTAAAATTATGTCTTTACTTAACTTTAATTGAA 
61 

TATGCTACCATCACATTCTTTGTAAAATTTTTAAATAATCTAGTTTCTGATGGTTTAGAT 
121 

GAAGTATTAAAAATATACTATTACCTCATTGTAAATCTTAATGTTAGTATGACTATCTAT 
181 

CATGCTTTATAATATTAAAGGAAAATTTAAAAATATCATGTTTTAGATATCAACTATTTA 
241 

ATTTTAAACATACAAATTAATAATAAATTGCAACTAAATAATAAATTATCTTGACATAAC 
301 

TTATAAAATGTTTTAATATATAATCTAAATAAAAGTAATAATAAAATGACTTTTAAAATT 
361 

TAAAAAAAGT AAGGAGAA AATTAATTGTTCA ATAAAATAGGTTCTA GAACTTGGAAATCA 
421 RBS M F N K J G F R T W K S 

GGAAAGCTTTGGCTTTATATGGGAGTGCTAGGATCAACTATTATTTTAGGATCAAGTCCT 

GKLWLYMGVLGSTIILGSSP 
481 Repeat I (SEQID70) j ^ 

GTATCTGCTATGGATAGTGTTGGAAATCAAAGTCAGGGCAATGTTTTAGAGCGTCGTCAA 
VSAMDSVGNQSQGNVLERRQ 

541 Repeat 2 (SEQ1D71) | ^ 

CGCGATGCAGAAAACAGAAGCCAAGGTAATGTTCTAGAGCGTCGTCAACGCGATGCAGAA 
RDAENRSQGNVLERRQRDAE 

I ^ Repeats (SEQID72) 

AACAGAAGCCAAGGTAATGTTCTAGAGCGTCGTCAACGTGATGCGGAAAACAAGAGCCAA 

NRSQGNVLERRQRDAENKSQ 
661 

GTAGGTCAACTTATAGGGAAAAATCCACTTCTTTCAAAGTCAATTATATCTAGAGAAAAT 

VGQLIGKNPLLSKSI ISREN 
721 

AATCACTCTAGTCAAGGTGACTCTAACAAACAGTCATTCTCTAAAAAAGTATCTCAGGTT 

NHSSQGDSNKQSFSKKVSQV 
781 

ACTAATGTAGCTAATAGACCGATGTTAACTAATAATTCTAGAACAATTTCAGTGATAAAT 

TNVANRPMLTNNSRTI SVIN 
841 

AAATTACCTAAAACAGGTGATGATCAAAATGTCATTTTTAAACTTGTAGGTTTTGGTTTA 

K L P K T G DDQNVI FKLVGFGL 
901 

ATTTTGTTAACAAGTCGCTGCGGTTTGAGACGCAATGAAAATTAAGTATAATCAATCATT 
ILLTSRCGLRRNEN* 

961 ^ ^ 

TAGTAACTATATATAATGATATATGCAATCAATAAAAAGGAATCGGATACGAGATTCCTT 



TTTATAATTAGGTTGGTTAGGGTGACTTTTTTCATTTGGCTATTCTTGAAAGTTTATAAA 
1081 

AATGTAGTATAATAGTCACATTAAAATGTTTTGAAAATATTGATGAACAACATCAACAAA 
1141 

TAGAGGTCAT 



Fig. 5 
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1 

GCATAAATAAGTCACAATTTCCTTCTTAAAATTATGTCTTTACTTAACTTTAATTGAATA 
61 

TGCTACCATCACATTCTTTGTAAAATTTTTAAATAATCTAGTTTCTGATGGTTTAGATGA 
121 

AGTATTAAAAATATACTATTACCTCATTGTAAATCTTAATGTTAGTATGACTATCTATCA 
181 

TGCTTTATAATATTAAAGGAAAATTTAAAAATATCATGTTTTAGATATCAACTATTTAAT 
241 

TTTAAACATACAAATTAATAATAAATTGCAACTAAATAATAAATTATCTTGACATAACTT 

301 

ATAAAATGTTTTAATATATAATCTAAATAAAAGTAATAATAAAATGACTTTTAAAATTTA 
361 

AAAAAAGTAAGGAGAAAATTAATTGTTCAATAAAATAGGTTTTAGAACTTGGAAATCAGG 
421 RBS MFNKIGFRTWKSG 
AAAGCTTTGGCTTTATATGGGAGTGCTAGGATCAACTATTATTTTAGGATCAAGTCCTGT 
KLWLYMGVLGSTIILGSSPV 
4 81 Repeat 1 (SEQ ID 73) i ^ 

ATCTGCTATGGATAGTGTTGGAAATCAAAGCCAAGGCAATGTTCTAGAGCGTCGTCAACG 
SAMDSVGNQSQGNVIiERRQR 

Repeat 2 (SEQ ID 74) j ^ 

CGATGCAGAAAACAGAAGCCAAGGTAATGTTTTAGAACGTCGTCAACGCGATGTTGAGAA 
DAENRSQGNVLERRQRDVEN 

6 01 I ► Repeats (SEQ ID 75) Repeat 4 (SEQ ID 76) j ^ 

CAAGAGCCAAGGTAATGTTTTAGAGCGTCGCCAACGTGATGCGGAAAACAAAAGTCAGGG 
KSQGNVLERRQRDAENKSQG 

Repeats (SEQ ID 77) , ^ 

CAATGTTTTAGAGCGTCGTCAACGTGATGCAGAAAACAGAAGCCAAGGTAATGTTCTAGA 
NVLERRQRDAENRSQGNVLE 
721 Repeat 6 (SEQ ID 78) j ^ 

GCGTCGTCAACGCGATGTTGAGAATAAGAGCCAAGGCAATGTTCTAGAGCGTCGTCAACG 
RRQRDVENKSQGNVLERRQR 
781 Repeat 7 (SEQ ID 79) i ^ 

CGATGTTGAGAATAAGAGCCAAGGTAATGTTCTAGAGCGTCGTCAACGCGATGTTGAGAA 
DVENKSQGNVLERRQRDVEN 
341 j ^ Repeats (SEQ ID 80) Repeat 9 (SEQ ID 81) j ^ 

TAAGAGCCAAGGTAATGTTCTAGAGCGTCGTCAACGTGATGCGGAAAACAAGAGCCAAGG 
KSQGNVLERRQRDAENKSQG 

901 RepcatlO (SEQ ID 82) I ^ 

CAATGTTCTAGAGCGTCGTCAACGCGATGCAGAAAACAGAAGCCAAGGTAATGTTTTAGA 
NVLERRQRDAENRSQGNVLE 

961 

GCGTCGCCAACATGATGTTGAGAATAAGAGTCAAGTAGGTCAACTTATAGGGAAAAATCC 
RRQHDVENKSQVGQLIGKNP 
1021 

ACTTTTTTCAAAGTCAACTGTATCTAGAGAAAATAATCACTCTAGTCAAGGTGACTCTAA 
LFSKSTVSRENNHSSQGDSN 
1081 



Fig. 6-1 
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CAAACAGTCATTCTCTAAAAAAGTATCTCAGGTTACTAATGTAGCTAATAGACCGATGTT 
KQSFS KKVSQVTNVANRPML 
1141 

AACTAATAATTCTAGAACAATTTCAGTGATAAATAAATTACCTAAAACAGGTGATGATCA 
TNNSRTISVINK L P K T G D D Q 

■ 1201 '. 

AAATGTCATTTTTAAACTTGTAGGTTTTGGTTTAATTTTATTAACAAGTCTCTGCGGTTT 

NVIFKLVGFGLILLTSLCGL 
1261 

GAGACGCAATGAAAATTAAGTATAATCAACCATTTAGTAACTATTATAATGATATATGCA 
R R N E N * 

1321 ^ ^ 

ATCAATAAAAAAGGAATCGAATACGAGATTCCTTTTTATAATTAGGTTGGTTAGGGTGAC 
1381 

TTTTTTCATTTGGCTATTCTTGAAAGTTTATAAAAATGTAGTATAATAGTCACATTAAAA 
1441 

TGTTTTGAAAATATTGATGAACAACATCATCAAATAGAGGTCAT 
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1 

GCATAAATAAGTCACAATTTCCTTCTAAAAATTATGTCTTTACTTAACTTTAATTGAATA 
61 

TGCTACCATCACATTCTTTGTT^AAATTTTTAAATAACCTAGTTTCTGATGGTTTAGATGA 
121 

AGTATTAAAAATATACTATTATCTCATTGTAAATCCTAATGTTAGTATGACTATCTATCA 
181 

TGTTTTATAATATTGAAGGAAAATTTAAAAATATCATGTTTTAGATATCAACTATTTAAT 
241 

TTTAAACATACAAATTAATAATAAATTGCAATTAAATAACAAATTACCTTGACATAAATT 
301 

ATAAAATGATTTAATATATATAATCTAAATAAAAATAATAATAAAATGACTTTTAAAATT 
361 

TAAAAAAAGT AAGGAGAA AATTAATTGTTCAATAAAATAGGTTTTAGAACTTGGAAATCA 
421 RBS MFNKIGFRTWKS 

GGAAAGCTTTGGCTTTATATGGGAGTGCTAGGATCAACTATTATTTTAGGATCAAGTCCT 
6KLWLYMGVLGSTIILGSSP 

Repeal 1 (SEQ ID 83) r ^ 

GTATCTGCTATGGATAGTGTTGGAAATCAAAGTCAAGGTAATGTTCTAGAGCGTCGCCAA 

VSAMDSVGNQSQGNVLERRQ 
541 Repeat 2 (SEQ ID 84) | ^ 

CGTGATGCGGATAACAAGAGCCAAGGTAATGTTTTAGAGCGTCGCCAACGTGATGCAGAT 
RDADNKSQGNVLERRQRDAD 

r ^ Repeats (SEQ ID 85) 

AACAAAAGTCAGCSGCAATGTTCTAGAACGTCGCCAACGTGATGTTGATAACAAGAGCCAA 

NKSQGNVLERRQRDVDNKSQ 
i ^ Repeat 4 (SEQ ID 86) Repeat 5 (SEQ ID 87) 



GGTT^CGTTCTAGAGCGTCGCCAACGCGATGCTGATAACAAGAGCCAAGGTAATGTTTTA 
GNVLERRQRDADNKSQGNVL 

"7 21 Repeat 6 (SEQ ID 88) r ^ 

GAGCGCCGCCAACGCGATGCAGATAACAAAAGTCAAGGTAATGTTCTAGAGCGTCGCCAA 

ERRQRDADNKSQGNVLERRQ 
781 Repeat? (SEQ ID 89) | ^ 

CGCGATGTTGATAACAAGAGCCAGGGTAATGTTTTAGAGCGTCGCCAACGCGATGCAGAT 

RDVDNKSQGNVLERRQRDAD 
841 ^ Repeats (SEQ ID 90) 

AACAT^AAGTCAGGGTAATGTTTTAGAGCGTCGCCAACGCGATGTTGATAACAAAAGCCAA 

NKSQGNVLERRQRDVDNKSQ 
I ^ Repeat 9 (SEQ ID 91) Repeat 10 (SEQ ID 92) | ^ 

GGTAATGTTTTAGAGCGTCGCCAACGTGATGCTGATAACAAAAGTCAGGGCAATGTTCTA 
GNVLERRQRDADNKSQGNVL 

961 Repeat 11 (SEQ ID 93) | ^ 

GAGCGTCGCCAACGTGATGCGGATAACAAAAGCCAAGGTAATGTTCTAGAGCGTCGCCAA 
ERRQRDADNKSQGNVLERRQ 

1021 Repeat 1 2 (SEQ ID 94) | ^ 

CGCGATGCGGATAACAAAAGTCAGGGCAATGTTTTAGAGCGTCGCCAACGTGATGCTGAT 
RDADNKSQGNVLERRQRDAD 
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1081 J ^ Repeat 13 (SEQID95) 

AACAAAAGTCAAGGTAATGTTCTAGAGCGTCGCCAACGCGATGCAGATAACAAAAGCCAA 

NKSQGNVLERRQRDADNKSQ 
I ^ Repeat 14 (SEQID96) Repeat 15 (SEQ ID 97) j ^ 

GGTAATGTTCTAGAGCGTCGCCAACGCGATGCTGATAACAAAAGTCAAGGTAATGTTCTA 

GNVLERRQRDADNKSQGNVL 
]^201 Repeat 16 (SEQ ID 98) | ^ 

GAGCGTCGCCAACGTGATGCTGATAACAAGAGCCAAGGCAATGTTCTTGAGCGTCGTCAA 

ERRQRDADNKSQGNVLERRQ 
12 61 Repeat 17 (SEQ ID 99) | ^ 

CGCGATGTCGATAACAAAAGTCAGGGTAATGTTTTAGAGCGTCGCCAACGTGATGCGGAT 

RDVDNKSQGNVLERRQRDAD 
1221 I ^ ^^^^ 

AACAAGAGTCAAGGTAATGTTTTAGAGCGTCGCCAACGCGATGCGGATAACAAGAGCCAA 

NKSQGNVLERRQRDADNKSQ 

I 1^ Repeat 19 (SEQ ID 101) Repeat 20 (SEQ ID 102) i ^ 

GGTAATGTTTTAGAGCGTCGCCAACGCGATGCGGATAACAAGAGTCAA(3GTAATGTTTTA 

GNVLERRQRDADNKSQGNVL 
144 3^ Repeat 21 (SEQ ID 103) j ^ 

GAGCGTCGCCAACGCGATGCGGATAACAAGAGCCAAGGTAATGTTTTAGAGCGTCGCCAA 

ERRQRDADNKSQGNVLERRQ 
1501 Repeat 22 (SEQ ID 104) | ^ 

CGCGATGCAGATAACAAAAGTCAAGGTAATGTTTTAGAGCGTCGCCAACGCGATGCTGAT 

RDADNKSQGNVLERRQRDAD 
l^gl I ► Repeat 23 (SEQ ID 1 05) 

AACAAGAGCCAAGGTAATGTTTTAGAGCGTCGTCAACGTGATGCAGATAACAAAAGTCAG 

NKSQGNVLERRQRDADNKSQ 
I ^ Repeat 24 (SEQ ID 106) Repeat 25 (SEQ ID 107) | ^ 

GGCAATGTTTTAGAGCGTCGTCT^CGTGATGCGGATAACAAGAGCCAAGGTAATGTTTTA 
GNVLERRQRDADNKSQGNVL 

1 ^ S 1 Repeat 26 (SEQ ID 1 08) , ^ 

GAGCGTCGCCAACGTGATGCGGATAACAAGAGCCAGGGCAATGTTCTAGAACGTCGTCAA 

ERRQRDADNKSQGNVLERRQ 
1741 Repeat 27 (SEQ ID 1 09) j ^ 

CGTGATGCGGATAACAAGAGCCAAGGTAACGTTTTAGAGCGTCGCCAACGTGATGCGGAT 
RDADNKSQGNVLERRQRDAD 

1801 ^ Repeat 28 (SEQ ID 1 10) 

AACAAGAGCCAGCSGCAATGTTTTAGAGCGCCGCCAACGCGATGCAGATAACAAAAGTCAA 

NKSQGNVLERRQRDADNKSQ 
I ^ Repeat 29 (SEQ ID 11 1) Repeat 30 (SEQ ID 112) | ^ 

GGTAATGTTCTAGAGCGTCGCCAACGCGATGCAGATAACAAGAGCCAGGGTAATGTTCTA 

GNVLERRQRDADNKSQGNVL 
1921 

GAGCGTCGCCAACGCGATGCGGAAAACAAAAGTCAAGTAGGTCAACTTATAGGGAAAAAT 

ERRQRDAENKSQVGQLIGKN 
1981 
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CCACTTTTTTCAAAGTCAACTGTATCTAGAGAAAATAATCACTCTAGTCAAGGTGACTCT 

PLFSKSTVSRENNHSSQGDS 
2041 

AACAAACAGTCATTCTCTAAAAAAATATCTCAGGTTACTAATGTAGCTAATGGACCGATG 

NKQSFSKKI SQVTNVANGPM 
2101 

TTAACTAATAATTCTAGAACAATTTCAGTGATAAATAAATTACCTAAAACAGGTGATGAT 

LTNNSRTISVINK L P K T G D D 
2161 

CAAAATGTCATTTTTAAACTTGTAGGTTTTGGTTTAATTTTGTTAACAAGTCTCTGCGGT 

QNVI FKLVGFGLILLTSLCG 
2221 

TTGAGACGCAATGAAAATTAAGTATAATCAACCATTTAGTAACTATTATAATGATATATG 

L R R N E N * 
2281 ^ ^ 

CAATCAATAAAAAAGGAATCGAATACGAGATTCCTTTTTATAATTAGGTTGGTTAGGGTG 
2341 2361 2381 

ACTTTTTTCATTTGGCTATTCTTGAAAGTTTATAAAAATGTAGTATAATAGTCACATTAA 
2401 2421 2441 

AATGTTTTGAAAATATTGATGi\ACAACATCATC7^AATAGAGGTCAT 
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GNVLERRQRDAENRSQ (SeqID 204) 
G LSQNRDVRENORARE (SeqID205) 
GNVLERRQRDAENRSQ 
G LSONRPVRENQRARE 
ANVLERRQRDAENRSQ (SeqID 206) 
GAVLERRQRDAENRSQ (SeqID 207) 
GNALERRQRDAENRSQ (SeqID 208) 
GNVAERRQRDAENRSQ (SeqID 209) 
GNVLARRQRDAENRSQ (SeqID 210) 
GNVLEARQRDAENRSQ (SeqID 211) 
GNVLERAQRDAENRSQ (SeqID 212) 
GNVLERRARDAENRSQ (SeqID 213) 
GNVLERRQADAENRSQ (SeqID 214) 
GNVLERRQRAAENRSQ (SeqID 215) 
GNVLERRQRDAENRSQ (SeqID 216) 
GNVLERRQRDAANRSQ (SeqID 217) 
GNVLERRQRDAEARSQ (SeqID 218) 
GNVLERRQRDAENASQ (SeqID 219) 
GNVLERRQRDAENRAQ (SeqID 220) 
GNVLERRQRDAENRSA (SeqID 221) 
GNVLERRQRDAENRSQ 
GLSQNRDVRENORARE 
GNVLERRQRDAENRSQ 
G LSQNRDVRENORARE 
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1 

ATTTTTAAGCAATATTTTAAAACATAAAT^AAAGAAAT^TCAACTACTTAAGCTAATTGAA 
61 

GTATTTCTAAGATAATAA7U^AATAAGATTATCAAATi\AAAAGAAAAATCATTCAAAAATT 
121 

GGGAAAAAACTTTAAAATTCCATACCTTATAATAAGAAATTATTGATATCATAATAAGTG 
181 

ATAGTTTGTATATTCTAGGATATTCTGTATCTGATCTTAGATTTAGAAACGACATTTCGG 
241 

CACAAT AGGAG TTGTAAAATGAGAAAATACCAAAAATTTTCTAAAATATTGACGTTAAGT 
301 RBS MRKYQKFSKILTLS 

CTTTTTTGTTTGTCGCAAATACCGCTTAATACCAATGTTTTAGGGGAAAGTACCGTACCG 
LFCLSQIPLNTNVLGESTVP 

361 

GAAAATGGTGCTAAAGGAAAGTTAGTTGTTAAAAAGACAGATGACCAGAACAAACCACTT 

ENGAKGKIiVVKKTDDQNKPIi 
421 

TCAAAAGCTACCTTTGTTTTAAAAACTACTGCTCATCCAGAAAGTAAAATAGAAAAAGTA 

SKATFVLKTTAHPESKI EKV 
481 

ACTGCTGAGCTAACAGGTGAAGCTACTTTTGATAATCTCATACCTGGAGATTATACTTTA 

TAELTGEATFDNLI PGDYTL 
541 

TCAGAAGAAACAGCGCCCGAAGGTTATAAAAAGACTAACCAGACTTGGCAAGTTAAGGTT 

SEETAPEGY KKTNQTWQVKV 
601 

GAGAGTAATGGAAAAACTACGATACAAAATAGTGGTGATAAAAATTCCACAATTGGACAA 

ESNGKTTIQNSGDKNSTIGQ 
661 

AATCACGAAGAACTAGATAAGCAGTATCCCCCCACAGGAATTTATGAAGATACAAAGGAA 

NHEELDKQYPPTGIYEDTKE 
721 

TCTTATAAACTTGAGCATGTTAAAGGTTCAGTTCCAAATGGAAAGTCAGAGGCAAAAGCA 
SYKLEHVKGSVPNGKSEAKA 

781 

GTTAACCCATATTCAAGTGAAGGTGAGCATATAAGAGAAATTCCAGAGGGAACATTATCT 

VNPYSSEGEHIREIPEGTLS 
841 

AAACGTATTTCAGAAGTAGGTGATTTAGCTCATAATAAATATAAAATTGAGTTAACTGTC 
KRISEVGDLAHNKYKIELTV 

901 

AGTGGAAAAACCATAGTAAAACCAGTGGACAAACAAAAGCCGTTAGATGTTGTCTTCGTA 
SGKTIVKPVDKQKPLDVVFV 

961 

CTCGATAATTCTAACTCAATGAATAACGATGGCCCAAATTTTCAAAGGCATAATAAAGCC 

LDNSNSMNNDGPNFQRHNKA 
1021 
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AAGAAAGCTGCCGAAGCTCTTGGGACCGCAGTAAAAGATATTTTAGGAGCAAACAGTGAT 

KKAAEALGTAVKDI LGANSD 
1081 

AATAGGGTTGCATTAGTTACCTATGGTTCAGATATTTTTGATGGTAGGAGTGTAGATGTC 
NRVALVTYGSDIFDGRSVDV 

1141 

GTAAAAGGATTTAAAGAAGATGATAAATATTATGGCCTTCAAACTAAGTTCACAATTCAG 

VKGFKEDDKYYGLQTKFTIQ 
1201 

ACAGAGAATTATAGTCATAAACAATTAACAAATAATGCTGAAGAGATTATAAAAAGGATT 

TENYSHKQLTNNAEEI I KRI 
1261 

CCTACAGAAGCTCCTAGAGCTAAATGGGGATCAACTACAAACGGACTTACTCCAGAGCAA 

PTEAPRAKWGSTTNGLTPEQ 
1321 

CAAAAGCAGTACTATCTTAGTAAAGTAGGGGAAACATTTACTATGAAAGCCTTCATGGAG 

QKQYYLSKVGETFTMKAFME 
1381 

GCAGATGATATTTTGAGTCAAGTAGATCGAAATAGTCAAAAAATTATTGTTCATATAACT 

ADDILSQVDRNSQKIIVHIT 
1441 

GATGGTGTTCCAACAAGATCATATGCTATTAATAATTTTAAATTGGGTGCATCATATGAA 
DGVPTRSYAINNFKLGASYE 

1501 

AGCCAATTTGAACAAATGAAAAAAAATGGATATCTAAATAAAAGTAATTTTCTACTTACT 

SQFEQMKKNGYLNKSNFLLT 
1561 

GATAAGCCCGAGGATATAAAAGGAAATGGGGAGAGTTACTTTTTGTTTCCCTTAGATAGT 

DKPEDIKGNGESYFLFPLDS 
1621 

TATCAAACACAGATAATCTCTGGAAACTTACAAAAACTTCATTATTTAGATTTAAATCTT 

YQTQI ISGNLQKLHYLDLNL 
1681 

AATTACCCTAAAGGTACAATTTATCGAAATGGACCAGTAAGAGAACATGGAACACCAACC 

NYPKGTIYRNGPVREHGTPT 
1741 

AAACTTTATATAAATAGTTTAAAACAGAAAAATTATGACATCTTTT^TTTTGGTATAGAT 

KLYINSLKQKNYDIFNFGID 
1801 

ATATCTGCTTTTAGACAAGTTTATAATGAGGATTATAAGAAAAATCAAGATGGTACTTTT 

I SAFRQVYNEDYKKNQDGTF 
1861 

CAAAAATTGAAAGAGGAAGCTTTTGAACTTTCAGATGGGGAAATAACAGAACTAATGAAG 

QKLKEEAFELSDGEITELMK 
1921 

TCATTCTCTTCTAAACCTGAGTATTATACCCCGATAGTAACTTCATCCGATGCATCTAAC 

SFSSKPEYYTPIVTSSDASN 
1981 
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AATGAAATTTTATCTAAAATTCAGCAACAATTTGAAAAGGTTTTAACAAAAGAAAACTCA 

NEILSKIQQQPEKVLTKENS 
2041 

ATTGTTAATGGAACTATAGAAGATCCTATGGGTGACAAAATCAATTTACAGCTTGGCAAC 

IVNGTIEDPMGDKINLQLGN 
2101 

GGACAAACATTGCAACCAAGTGATTATACTTTACAGGGAT^TGATGGAAGTATAATGAAA 

GQTLQPSDYTLQGNDGSIMK 
2161 

GATAGCATTGCAACTGGTGGGCCTAATAATGATGGTGGAATACTTAAAGGGGTTAAATTA 

DSIATGGPNNDGGILKGVKL 
2221 

GAATACATCAAAAATAAACTCTACGTTAGAGGTTTGAACTTAGGGGAGGGACAAAAAGTA 

EYIKNKLYVRGLNLGEGQKV 
2281 

ACACTCACATATGATGTGAAACTAGATGACAGTTTTATAAGTAACAAATTCTATGACACT 

TLTYDVKLDDSFI SNKFYDT 
2341 

AATGGTAGAACAACATTGAATCCTAAATCAGAGGATCCTAATACACTTAGAGATTTTCCA 

NGRTTLNPKSEDPNTLRDFP 
2401 

ATCCCTAAAATTCGTGATGTGAGAGAATATCCTACAATAACGATTAAAAACGAGAAGAAG 

IPKIRDVREYPTITIKNEKK 
2461 

TTAGGTGAAATTGAATTTACAAAAGTTGATAAAGATAATAATAAGTTGCTTCTCAAAGGA 

LGEIEFTKVDKDNNKLLLKG 
2521 

GCTACGTTTGAACTTCAAGAATTTAATGAAGATTATAAACTTTATTTACCAATAAAAAAT 

ATFELQEFNEDYKLYLPI KN 
2581 

AATAATTCAAAAGTAGTGACGGGAGAAAACGGCAAAATTTCTTACAAAGATTTGAAAGAT 

NNSKVVTGENGKI SYKDLKD 
2641 

GGCAAATATCAGTTAATAGAAGCAGTTTCGCCGAAGGATTATCA/^AAAATTACTAATAAA 

GKYQLIEAVSPKDYQKITNK 
2701 

CCAATTTTAACTTTTGAAGTTGTTAAAGGATCGATACAAAATATAATAGCTGTTAATAAA 

PILTFEVVKGSIQNI lAVNK 
2761 

CAGATTTCTGAATATCATGAGGAAGGTGACAAGCATTTAATTACCAACACGCATATTCCA 

QISEYHEEGDKHLITNTHI P 
2821 

CCAAAAGGAATTATTCCGATGACAGGTGGGAAAGGAATTCTATCTTTCATTTTAATAGGT 

P K G I I P M T G GKGILSFILIG 
2881 

GGATCTATGATGTCTATTGCAGGTGGAATTTATATTTGGAAAAGATATAAGAAATCTAGT 

GSMMSIAGGIYIWKRYKKSS 
2941 
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GATATATCTAGAGAAAAAGATTAAGAATCATGTGTTTTAGTATTCTTAATTTU^TTAAATA 

DISREKD* 
3001 

TAATTCGAAAGGAGTGGTGCTGCGGTAATATTATAATCCGTATATTATTATCTATGTTGA 
3061 

TTAACTAGAATAAGAAGGAGATAGAAATGAAAAAAATCAACAAATGTCTTACAGTGTTCT 
3121 RBS MKKlNKCIsTVF 

CGACACTGCTATTGATCTTAACGTCACTATTCTCAGTTGCACCAGCGTTTGCGGACGACG 
STLLLIhTSLFSVAPAFABD 
3181 

TAACAACTGATACTGTGACCTTGCACAAGATTGTCATGCCACAAGCTGCATTTGATAACT 

VTTDTVTLHKIVMPQAAFDN 

3241 

TTACTGAAGGTACAAAAGGTAAGAATGATAGCGATTATGTTGGTAAACAAATTAATGACC 

FTEGTKGKNDSDYVGKQIND 

3301 

TTAAATCTTATTTTGGCTCAACCGATGCTAAAGAAATTAAGGGTGCTTTCTTTGTTTTCA 

LKSYFGSTDAKEIKGAFFVF 

3361 

AAAATGAAACTGGTACAAAATTCATTACTGAAAATGGTAAGGAAGTCGATACTTTGGAAG 
KNETGTKF ITENGKEVDTLE 

3421 

CTAAAGATGCTGAAGGTGGTGCTGTTCTTTCAGGGTTAACAAAAGACACTGGTTTTGCTT 

AKDAEGGAVLSGLTKDTGFA 

3481 

TTAACACTGCTAAGTTAAAAGGAACTTACCAAATCGTTGAATTGAAAGAAAAATCAAACT 

FNTAKLKGTYQIVELKEKSN 

3541 

ACGATAACAACGGTTCTATCTTGGCTGATTCAAAAGCAGTTCCAGTTAAAATCACTCTGC 
YDNNGS ILADSKAVPVKITL 
3601 

CATTGGTAAACAACCAAGGTGTTGTTAAAGATGCTCACATTTATCCAAAGAATACTGAAA 
PLVNNQGVVKDAHI YPKNTE 
3661 

CAAAACCACAAGTAGATAAGAACTTTGCAGATAAAGATCTTGATTATACTGACAACCGAA 

TKPQVDKNFADKDLDYTDNR 

3721 

AAGACAAAGGTGTTGTCTCAGCGACAGTTGGTGACAAAAAAGAATACATAGTTGGAACAA 
KDKGVVSATVGDK KEYIVGT 
3781 

A/y^TTCTTAAAGGCTCAGACTATAAGAAACTGGTTTGGACTGATAGCATGACTAAAGGTT 
KI LKGSDYKKLVWTDSMTKG 
3841 

TGACGTTCAACAACAACGTTAAAGTAACATTGGATGGTAAAGATTTTCCTGTTTTAAACT 

LTFNNNVKVTLDGKDFPVLN 

3901 

ACAAACTCGTAACAGATGACCAAGGTTTCCGTCTTGCCTTGAATGCAACAGGTCTTGCAG 

YKLVTDDQGFRLALNATGLA 

3961 
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CAGTAGCAGCTGCTGCAAAAGACAAAGATGTTGAAATCAAGATCACTTACTCAGCTACGG 
AV AAAAKDKDVEIKITYSAT 
4021 

TGAACGGCTCCACTACTGTTGAAGTTCCAGAAACCAATGATGTTAAATTGGACTATGGTA 

VNGSTTVEVPETNDVKLDYG 

4081 

ATT^CCCAACGGAAGAAAGTGAACCACAAGAAGGTACTCCAGCTAACCAAGAAATTAAAG 
NNPTEESEPQEGTPANQE I K 
4141 

TCATTAAAGACTGGGCAGTAGATGGTACAATTACTGATGTTAATGTTGCAGTTAAAGCTA 
VI KDWAVDGTI TDVNVAVKA 
4201 

TCTTTACCTTGCAAGAAAAACAAACGGATGGTACATGGGTGAACGTTGCTTCACACGAAG 
I FTLQEKQTDGTWVNVASHE 

4261 

CAACAAAACCATCACGCTTTGAACATACTTTCACAGGTTTGGATAATACTAAAACTTACC 

ATKPSRFEHTFTGLDNTKTY 

4321 

GCGTTGTCGAACGTGTTAGCGGCTACACTCCAGAATATGTATCATTTAAAAATGGTGTTG 

RVVERVSGYTPEYVSFKNGV 

4381 

TGACTATCAAGAACAACAAAAACTCAAATGATCCAACTCCAATCAACCCATCAGAACCAA 

VTIKNNKNSNDPTPINPSEP 

4441 

AAGTGGTGACTTATGGACGTAAATTTGTGAAAACAAATCAAGCTAACACTGAACGCTTGG 

KVVTYGRKFVKTNQANTERL 

4501 

CAGGAGCTACCTTCCTTGTTAAGAAAGAAGGAAAATACTTGGCACGTAAAGCAGGTGCAG 

AGATFLVKKEGKYLARKAGA 

4561 

CAACTGCTGAAGCAAAGGCAGCTGTAAAAACTGCTAAACTAGCATTGGATGAAGCTGTTA 

ATAEAKAAVKTAKLALDEAV 

4621 

AAGCTTATAACGACTTGACTAAAGAAAAACAAGAAGGCCAAGAAGGTAAAACAGCATTGG 

KAYNDLTKEKQEGQEGKTAL 

4681 

CTACTGTTGATCAAAAACAAAAAGCTTACAATGACGCTTTTGTTAAAGCTAACTACTCAT 

ATVDQKQKAYNDAFVKANYS 

4741 

ATGAATGGGTTGCAGATAAAAAGGCTGATAATGTTGTTAAATTGATCTCTAACGCCGGTG 
YEWVADKKADNVVKLI SNAG 
4801 

GTCAATTTGAAATTACTGGTTTGGATAAAGGCACTTATAGCTTGGAAGAAACTCAAGCAC 

GQFEITGLDKGTYSLEETQA 

4861 

CAGCAGGTTATGCGACATTGTCAGGTGATGTAAACTTTGAAGTAACTGCCACATCATATA 

PAGYATLSGDVNFEVTATSY 

4921 
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GCAAAGGGGCTACAACTGACATCGCATATGATAAAGGATCTGTAAT^AAAAGATGCCCAAC 

SKGATTDIAYDKGSVKKDAQ 

4981 

AAGTTCAAAACAAAAAAGTAACCATCCCACAAACAGGTGGTATTGGTACAATTCTTTTCA 
QVQNKKVT I P Q T G G I G T I L F 
5041 

CAATTATTGGTTTAAGCATTATGCTTGGAGCAGTAGTTGTCATGAAAAAACGTCAATCAG 

TIIGIiSIMLGAVVVMKKRQS 

5101 

AGGT^GCTTAAGGCTAGTCTTTGATGGTGTATAAGCACAGTTAAAGCTGTGCTTATGATC 

E E A * 

5161 

TAAGGGTATTTCAGTAGAAGTACTCTTAGATCATAAGCAAGAGCCATTATTTAGGAGATG 
5221 

ACGTGAAGACTAAAAATATCT^CAAT^AAAACTAAAAAGAAGAAGTCAAATCTTCCTTTTA 
5281 

TCATTCTTTTTCTAATAGGTCTATCTATTTTATTGTATCCAGTGGTATCACGTTTTTACT 
5341 

ATACGATAGAATCTAATAATCAAACACAGGATTTTGAGAGAG 
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1 

GCTCATGATAATTTATAGAACATTTATAAAATCTTATAATAAACTGGTTAAGTATAGGAA 
61 

ATACTGCATATTTCTTGAAAATATGGTGTATATTGTGT^TATU^TGATGACCAAGTTAAT 
121 

TGAATTTTCCTATCGAAAAATTTTTCAAAAAAAATAATTTCACGCTCAAATCATTTGATT 
181 

GTCAAATAAATAGAGCCTTTATAAAAATATTATATAAGTATAAAATGTA/^AAAAATAAAA 
241 

AAATGATATTTTTATTTGATTCAAATGTATTTAATAAAAATACAAAGTTTCTAAAAAAGT 
301 

AAAAATTCCATCTCAATAAACAGCGTTAGTTATTATAACCGAACATTATTGTCCTTAAT^ 
361 

CATT/U\AACAAAAACAAAAGTTCGTAATTTAATTAATTTGTCATGTTACTAATCTTATGC 
421 

TAATATATTATCTCGTGATAAGTTTTTGATGTA7VAAATTATCATGAAAAAGAAA AGAGAG 
481 RBS 
ATGGAAATGAAAAAACAATTTTTAAAATCAGCAGCGATTCTATCGCTAGCAGTAACAGCA 
541 MKKQFLK3AA IL3LAVTA 
GTATCTACAAGTCAGCCGGTAGCCGGGATAACTAAAGATTATAATAACCGAAATGAAAAA 

VSTjSQPVAGITKDYNNRNE K 
601 

GTAAAAAAGTATTTACAAGAAAATAATTTCGGTCATAAAATAGCGTATGGATGGAAAAAT 

VKKYLQENNFGHKIAYGWKN 
661 

AAAGTAGAATTTGATTTTCGTTATTTATTGGATACTGCTAAATATTTAGTAAATAAAGAA 

KVEFDFRYLLDTAKYLVNKE 
721 

GAATTTCAAGATCCTTTATATAATGATGCGCGCGAAGAATTGATAAGTTTTATTTTTCCT 

EFQDPLYNDAREELISFIFP 
781 

TATGAGAAATTTTTAATTAACAATCGTGACATAACTAAATTAACAGTTAATCAGTATGAA 

YEKFLINNRDITKLTVNQYE 
841 

GCGATTGTGAATAGAATGAGTGTTGCTTTACAAAAATTTTCAAAGAATATTTTTGAGAAA 

AIVNRMSVALQKFSKNIFEK 
901 

CAGAAAGTAAATAAAGATTTAATCCCTATTGCGTTTTGGATTGAGAAAAGTTACAGAACT 

QKVNKDLI PIAFWIEKSYRT 
961 

GTTGGAACGAATGAAATCGCCGCTTCTGTAGGCATTCAAGGAGGATTTTATCAAAACTTC 

VGTNEIAASVGIQGGFYQNF 
1021 

CATGATTATTATAATTATTCATATCTATTAAATTCTTTATGGCATGAAGGAAATGTAAAA 

HDYYNYSYLLNS LWHEGNVK 
1081 

GAAGTAGTTAAGGATTATGAAAACACTATTCGTCAAATACTATCTAAAAAGCATGAGATT 

EVVKDYENTIRQILSKKHEI 
1141 

Fig. 17-1 
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GAAAAAATTCTTAATCAGAGCACTTCTGATATCTCTATAGATGATGATGATTACGAAAAA 
EKILNQSTSDI S I DDDDYE K 

1201 

GGAAATAAAGAATTGCTAAGGGAAAAATTAAATATTATTCTAAATCTTTCAAAGAGAGAT 

GNKEIiLREKLNl ILNLSKRD 
1261 

TACAGAGTAACTCCATACTATGAAGTGAATAAACTACATACAGGGCTTATTTTATTGGAG 

YRVTPYYEVNKLHTGL ILLE 
1321 

GATGTCCCTAATTTAAAGATTGCTAAGGATAAGTTGTTCTCATTAGAGAATTCTTTAAAG 

DVPNLKIAKDKLFSLENSLK 
1381 

GAATACAAAGGAGAGAAAGTTAATTATGAGGAACTAAGATTCAATACGGAACCTTTAACT 

EYKGEKVNYEELRFNTE. PLT 
1441 

AGTTACTTAGAAAATAAAGAAAAATTTTTAGTCCCCAATATTCCATATAAAAATAAATTA 

SYLENKEKFLVPNI PYKNKL 
1501 

ATTTTAAGGGAAGAAGATAAATATAGTTTTGAAGATGATGAAGAAGAGTTTGGAAATGAA 

ILREEDKYSFEDDEEEFGNE 
1561 

CTTCTAAGTTACAATAAGCTTAAGAATGAAGTTTTACCTGTTAATATTACAACTTCTACT 

LLSYNKLKNEVLPVNI TTST 
1621 

ATATTAAAACCGTTTGAACAGAAGAAAATTGTGGAAGATTTTAATCCTTATTCTAATTTA 

ILKPPEQKKIVEDFNPYSNL 
1681 

GACAATTTAGAAATAAAAAAAATAAGGTTGAATGGCTCCCTVAAAACAAAAAGTAGAACAG 

DNLEI KKIRLNGSQKQKVEQ 
1741 

GAAAAAACTAAATCGCCAACTCCTCAAAAAGAGACTGTGAAAGAACAAACTGAGCAAAAA 

EKTKSPTPQKETVKEQTEQK 
1801 

GTATCTGGAAATACTCAAGAGGTAGAAAAGAAATCTGAAACTGTGGCAACTTCACAACAA 

VSGNTQEVEKKSETVATSQQ 
1861 

AGTTCAGTTGCGCAAACTTCTGTCCAACAGCCGGCTCCGGTTCAATCAGTTGTTCAAGAA 

SSVAQTSVQQPAPVQSVVQE 
1921 

TCCAAAGCTTCTCAAGAGGAGATTAATGCAGCACACGATGCTATTTCGGCGTATAAATCA 

SKASQEEINAAHDAI SAYKS 
1981 

ACAGTCAATATTGCTAATACAGCCGGTGTAACAACTGCGGAAATGACCACGCTCATTAAT 

TVNIANTAGVTTAEMTTLIN 
2041 

ACTCAAACTTCTAATCTTTCTGATGTTGAGAAAGCTTTAGGAAATAATAAGGTTAATT^T 

TQTSNLSDVEKALGNNKVNN 
2101 

GGTGCAGTCAATGTATTGAGAGAAGATACAGCTCGTCTTGAGAATATGATTTGGAATCGT 

GAVNVLREDTARIiENMIWNR 
2161 Fig. 17-2 
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GCTTACCAAGCTATTGAAGAATTCAACGTCGCTCGTAATACTTATAATAACCAAATCAAG 
AYQAI EE FNVARNTYNNQ I K 

2221 

ACAGAAACAGTTCCAGTTGATAATGATATTGAAGCTATTTTAGCAGGTTCTCAAGCTAAA 

TETVPVDNDIEAILAGSQAK 
2281 

ATTAGCCATTTGGACAATCGTATCGGAGCGCGCCACATGGATCAAGCTTTTGTAGCTAGT 

I SHLDNRIGARHMDQAF VAS 
2341 

TTATTAGAAGTTACTGAGATGAGTAAATCAATCTCATCGCGTATAAAAGAGTAGACACTG 

LLEVTEMSKSISSRIKE* 
2401 

CTATCAAGGCGATCTTAAACTTTTGTATTAAACTAACCTAAAAGAT AGAAAGA GACTAAT 
2461 RBS 
ATGAAAAAAATAACAACTTTAATCTTAGCTAGTAGCTTATTACTAGTTGCAACGACATCG 
MKKITTLILASSLLLVATTS 

2521 

GTTAAAGCTGATGATAACTTTGA7^TGCCAACGCGTTATGTTAA7^TGAGTGAAi\AATCA 

VICADDNFEMPTRYVKMSEKS 
2581 

AAAGCATTTTATCAAAGACTACAAGAAAAACAACGTAAGGCACATACTACTGTGAAGACT 

KAFYQRLQEKQRKAHTTVKT 
2641 

TTTAATAATTCAGAAATAAGGCATCAACTACCTCTTAAACAAGAAAAGGCTAGAAATGAT 

FNNSEIRHQLPLKQEKARND 
2701 

ATCTACAATTTAGGCATTCTTATTTCTCAGGAGTCTAAAGGGTTCATCCAACGTATTGAT 

lYNLGILISQESKGFIQRID 
2761 

AATGCCTATTCTTTGGAAAATGTCTCAGATATTGTTAATGAAGCTCAGGCTTTGTATAAA 

NAYSLENVSDIVNEAQALYK 
2821 

CGTAACTATGATTTATTTGAAAAAATCAAATCTACACGTGATAAGGTTCAAGTCTTACTT 

RNYDL FEKI KSTRDKVQVLL 
2881 

GCATCGCATCAAGATAATACAGACTTAAAAAACTTTTATGCTGAGTTAGATGATATGTAT 

ASHQDNTDLKNFYAELDDMY 
2941 

GAACATGTTTATCTCAATGAAAGTAGAGTGGAGGCGATAAACAGAAATATCCAAAAATAT 

EHVYLNESRVEAINRNIQKY 
3001 

AATTAGTTTCTAAACTAACAAACATTCCTAAATATAAGATATTAT^CCCTACTTATTGAT 

N * 

3061 

TAGTGAGTAGGGTTTTACTGTTTTAAATAGCTTTCTGCTCAGAATGTAAGCCTTGTCATT 
3121 

TCAAAGGAACTATGTTATTATTCTTAAGTAAATTAAATAGGACATTTGGGGTGCGTAACA 
3181 

GCTGAGATTATACCCATTGA 



Fig. 17-3 




Fig. 18 
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<12 0> Nucleic acids coding for adhesion factors of group B streptococcus, 
adhesion factors of group B streptococcus and further uses thereof 



<130> I 10003 PCX 












<160> 258 














<170> Patentin version 3.1 










<210> 1 
<211> 1329 
<212> DNA 

<213> Streptococcus i 


agalactiae 










<400> 1 
ttgttcaata 


aaataggttt 


tagaacttgg 


aaatcaggaa 


agctttggct 


ttatatggga 


60 


gtgctaggat 


caactattat 


tttaggatca 


agtcctgtat 


ctgctatgga 


tagtgttgga 


120 


aatcaaagtc 


agggcaatgt 


1 1 tagagcgt 


cgtcaacgtg 


atgcagaaaa 


cagaagccaa 


180 


ggcaatgttc 


tagagcgtcg 


tcaacgcgat 


gttgagaata 


agagccaagg 


caatgtttta 


240 


gagcgtcgtc 


aacgtgatgc 


ggaaaacaag 


agccaaggca 


atgttttaga 


gcgtcgtcaa 


300 


cgtgatgcag 


aaaacagaag 


ccaaggcaat 


gttctagagc 


gtcgtcaacg 


tgatgcagaa 


360 


aacagaagcc 


aaggcaatgt 


tc tagagcgt 


cgtcaacgcg 


atgcagaaaa 


cagaagccaa 


420 


ggtaatgttc 


tagagcgtcg 


tcaacgtgat 


gcagaaaaca 


gaagccaagg 


taatgttcta 


480 


gagcgtcgtc 


aacgtgatgc 


agaaaacaga 


agccaaggta 


atgttctaga 


gcgtcgtcaa 


540 


cgcgatgttg 


agaataagag 


ccaaggcaat 


gttttagagc 


gtcgtcaacg 


tgatgcggaa 


600 


aacaagagcc 


aaggcaatgt 


1 1 1 agagcg t 


cgtcaacgtg 


atgcagaaaa 


cagaagccaa 


660 


ggcaatgttt 


tagagcgtcg 


tcaacgtgat 


gcagaaaaca 


gaagccaagg 


caatgttcta 


720 


gagcgtcgtc 


aacgtgatgc 


agaaaacaga 


agccaaggca 


atgttctaga 


gcgtcgtcaa 


780 


cgtgatgcag 


aaaacagaag 


ccaacracaat 

^^^^ «3 **** 


gttctagagc 


gtcgtcaacg 


cgatgcagaa 


840 


aacagaagcc 


aaggtaatgt 


tctagagcgt 


cgtcaacgtg 


atgcagaaaa 


cagaagccaa 


900 


ggcaatgttt 


tagagcgtcg 


tcaacgtgat 


gcagaaaaca 


gaagccaagg 


caatgtttta 


960 


gagcgtcgtc 


aacgtgatgc 


agaaaacaga 


agccaaggca 


atgttttaga 


gcgtcgtcaa 


1020 


cgtgatgcgg 


aaaacaagag 


ccaagtaggt 


caacttatag 


ggaaaaatcc 


acttctttca 


1080 


aagtcaatta 


tatctagaga 


aaataatcac 


tcgagtcaag 


gtgactctaa 


caaacagtca 


1140 


ttctctaaaa 


aagtatctca 


ggttactaat 


gtagctaata 


gaccgatgtt 


aactaataat 


1200 


tctagaacaa 


tttcagtgat 


aaataaatta 


cctaaaacag 


gtgatgatca 


aaatgtcatt 


1260 


tttaaacttg 


taggttttgg 


tttaattttg 


ttaacaagtc 


gctgcggttt 


gagacgcaat 


1320 


gaaaattaa 












1329 
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<210> 2 

<211> 1233 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 2 
ttgttcaata 


aaataggttt 


tagaacttgg 


aaatcaggaa 


agctttggct 


ttatatggga 


60 


gtgctaggat 


caactattat 


tttaggatca 


agttctgtat 


ctgctatgga 


tagtgttgga 


120 


aatcaaagtc 


agggcaatgt 


tttagagcgt 


cgtcaacgcg 


atgcagaaaa 


cagaagccaa 


180 


ggcaatgttt 


tagagcgtcg 


tcaacgtgat 


gcagaaaaca 


gaagccaagg 


caatgtttta 


240 


gagcgtcgtc 


aacgtgatgc 


agaaaacaga 


agccaaggta 


atgttctaga 


gcgtcgtcaa 


300 


cgcgatgttg 


aaaataaaag 


ccaaggcaat 


gttttagagc 


gtcgtcaacg 


tgatgcagaa 


360 


aacagaagcc 


aaggtaatgt 


tctagagcgt 


cgtcaacgcg 


atgttgaaaa 


taaaagccaa 


420 


ggcaatgttt 


tagagcgtcg 


tcaacgtgat 


gcagaaaaca 


gaagccaagg 


taatgttcta 


480 


gagcgtcgtc 


aacgtgatgc 


agaaaacaga 


agccaaggca 


atgttttaga 


gcgtcgtcaa 


540 


cgcgatgcag 


aaaacagaag 


ccaaggcaat 


gttctagagc 


gtcgtcaacg 


tgatgctgaa 


600 


aacaaaagcc 


aaggcaatgt 


tttagagcgt 


cgtcaacgtg 


atgcagaaaa 


cagaagccaa 


660 


ggcaatgttt 


tagagcgtcg 


tcaacgtgat 


gctgaaaaca 


gaagccaagg 


caatgtttta 


720 


gagcgtcgtc 


aacgcgatgc 


agaaaacaga 


agccaaggta 


atgttctaga 


gcgtcgtcaa 


780 


cgtgatgcgg 


aaaacaagag 


ccaaggcaat 


gttttagagc 


gtcgtcaacg 


tgatgcagaa 


840 


aacagaagcc 


aaggcaatgt 


tttagagcgt 


cgccaacgcg 


a ugageici 


^ a a a ^ a T\ 


J V w 


ggcaatgttt 


tagagcgtcg 


tcaacgtgat 


gcggaaaaca 


agagccaagt 


aggtcaactt 


960 


atagggaaaa 


atccacttct 


ttcaaagtca 


attatatcta 


gagaaaataa 


tcactctagt 


1020 


caaggtgact 


ctaacaaaca 


gtcattctct 


aaaaaagtat 


ctcaggttac 


taatgtagct 


1080 


aatagaccga 


tgttaactaa 


taattctaga 


acaatttcag 


tgataaataa 


attacctaaa 


1140 


acaggtgatg 


atcaaaatgt 


catttttaaa 


Cuugcaggtu 






1200 


agtcgctgcg 


gtttgagacg 


caatgaaaat 


taa 






1233 


<210> 3 
<211> 1041 
<212> DNA 

<213> Streptococcus agalactiae 










<400> 3 
ttgttcaata 


aaataggttt 


tagaacttgg 


aaatcaggaa 


agctttggct 


ttatatggga 


60 


gtgctaggat 


caactattat 


tttaggatca 


agtcctgtat 


ctgctatgga 


tagtgttgga 


120 
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aatcaaagtc 


aaggtaatgt 


tctagagcgt 


ggcaatgttc 


tagaacgtcg 


tcaacgcgat 


gagcgtcgtc 


aacgcgatgc 


ggataacaag 


aataagagcc 


aaggcaatgt 


tctagagcgt 


ggtaatgttc 


tagagcgtcg 


tcaacgcgat 


gaacgtcgtc 


aacgcgatgt 


ggaaaacaaa 


cgtgatgttg 


agaataagag 


ccaaggcaat 


aacaaaagtc 


agggtaatgt 


tctagagcgt 


ggtaatgttc 


tagaacgtcg 


tcaacgcgat 


gagcgtcgcc 


aacgtgatgt 


tgagaacaag 


ccacttcttt 


caaagtcaac 


tatatctaga 


aacaaacagt 


cattctctaa 


aaaagtatct 


ttaactaata 


attctagaac 


aatttcagtg 


caaaatgtca 


tttttaaact 


tgtaggtttt 


ttgagacgca 


atgaaaatta 


a 



<210> 4 
<211> 561 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 4 

ttgttcaata aaataggttt tagaacttgg 
gtgctaggat caactattat tttaggatca 
aatcaaagtc agggcaatgt tttagagcgt 
ggtaatgttc tagagcgtcg tcaacgcgat 
gagcgtcgtc aacgtgatgc ggaaaacaag 
ccacttcttt caaagtcaat tatatctaga 
aacaaacagt cattctctaa aaaagtatct 
ttaactaata attctagaac aatttcagtg 
caaaatgtca tttttaaact tgtaggtttt 
ttgagacgca atgaaaatta a 
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cgtcaacgtg 


atgcggataa 


caagagccaa 


180 


gtagaaaaca 


gaagccaagg 


caatgttcta 


240 


agccaaggca 


atgttttaga 


gcgccgccaa 


300 


cgccaacgtg 


atgcagaaaa 


caaaagtcag 


420 


gcagataaca 


agagccaagg 


taatgttcta 


480 


agtcagggca 


atgttctaga 


acgtcgtcaa 


540 


gttctagagc 


gtcgccaacg 


tgatgcagaa 


600 


cgtcaacgcg 


atgcagataa 


caagagccaa 


660 


gtggaaaaca 


aaagtcaggg 


caatgttcta 


720 


agccaagtag 


gtcaacttat 


agggaaaaat 


780 


gaaaataatc 


actctagtca 


aggtgactct 


840 


caggttacta 


atgtagctaa 


tagaccaatg 


900 


ataaataaat 


tacctaaaac 


aggtgatgat 


960 


ggtttaattt 


tgttaacaag 


tcgctgcggt 


1020 








1041 


aaatcaggaa 


agctttggct 


ttatatggga 


60 


agtcctgtat 


ctgctatgga 


tagtgttgga 


120 


cgtcaacgcg 


atgcagaaaa 


cagaagccaa 


180 


gcagaaaaca 


gaagccaagg 


taatgttcta 


240 


agccaagtag 


gtcaacttat 


agggaaaaat 


300 


gaaaataatc 


actctagtca 


aggtgactct 




caggttacta 


atgtagctaa 


tagaccgatg 


420 


ataaataaat 


tacctaaaac 


aggtgatgat 


480 


ggtttaattt 


tgttaacaag 


tcgctgcggt 


540 



561 



<210> 5 
<211> 897 
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<212> DNA 

<213> Streptococcus agalactiae 



<400> 5 



ttgttcaata 


aaataggttt 


tagaacttgg 


aaatcaggaa 


agctttggct 


ttatatggga 


60 


gtgctaggat 


caactattat 


tttaggatca 


agtcctgtat 


ctgctatgga 


tagtgttgga 


120 


aatcaaagcc 


aaggcaatgt 


tctagagcgt 


cgtcaacgcg 


atgcagaaaa 


cagaagccaa 


180 


ggtaatgttt 


tagaacgtcg 


tcaacgcgat 


gttgagaaca 


agagccaagg 


taatgtttta 


240 


gagcgtcgcc 


aacgtgatgc 


ggaaaacaaa 


agtcagggca 


atgttttaga 


gcgtcgtcaa 


300 


cgtgatgcag 


aaaacagaag 


ccaaggtaat 


gttctagagc 


gtcgtcaacg 


cgatgttgag 


360 


aataagagcc 


aaggcaatgt 


tctagagcgt 


cgtcaacgcg 


atgttgagaa 


taagagccaa 


420 


ggtaatgttc 


tagagcgtcg 


tcaacgcgat 


gttgagaata 


agagccaagg 


taatgttcta 


480 


gagcgtcgtc 


aacgtgatgc 


ggaaaacaag 


agccaaggca 


atgttctaga 


gcgtcgtcaa 


540 


cgcgatgcag 


aaaacagaag 


ccaaggtaat 


gccutagagc 


gtcgccaaca 


ugatgccgag 


dQO 


aataagagtc 


aagtaggtca 


acttataggg 


aaaaatccac 


ttttttcaaa 


gtcaactgta 


660 


tctagagaaa 


ataatcactc 


tagtcaaggt 


gactctaaca 


aacagtcatt 


ctctaaaaaa 


720 


gtatctcagg 


ttactaatgt 


agctaataga 


ccgatgttaa 


ctaataattc 


tagaacaatt 


780 


tcagtgataa 


ataaattacc 


taaaacaggt 


gatgatcaaa 


atgtcatttt 


taaacttgta 


840 


ggttttggtt 


taattttatt 


aacaagtctc 


tgcggtttga 


gacgcaatga 


aaattaa 


897 



<210> 6 
<211> 1857 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 6 

ttgttcaata aaataggttt tagaacttgg aaatcaggaa agctttggct ttatatggga 60 

gtgctaggat caactattat tttaggatca agtcctgtat ctgctatgga tagtgttgga 120 

aatcaaagtc aaggtaatgt tctagagcgt cgccaacgtg atgcggataa caagagccaa 180 

ggtaatgttt tagagcgtcg ccaacgtgat gcagataaca aaagtcaggg caatgttcta 240 

gaacgtcgcc aacgtgatgt tgataacaag agccaaggta acgttctaga gcgtcgccaa 300 

cgcgatgctg ataacaagag ccaaggtaat gttttagagc gccgccaacg cgatgcagat 360 

aacaaaagtc aaggtaatgt tctagagcgt cgccaacgcg atgttgataa caagagccag 420 

ggtaatgttt tagagcgtcg ccaacgcgat gcagataaca aaagtcaggg taatgtttta 480 

gagcgtcgcc aacgcgatgt tgataacaaa agccaaggta atgttttaga gcgtcgccaa 54 0 

cgtgatgctg ataacaaaag tcagggcaat gttctagagc gtcgccaacg tgatgcggat 600 
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aacaaaagcc 


aaggtaatgt 


tctagagcgt 


cgccaacgcg 


atgcggataa 


caaaagtcag 


660 


ggcaatgttt 


tagagcgtcg 


ccaacgtgat 


gctgataaca 


aaagtcaagg 


taatgttcta 


720 


gagcgtcgcc 


aacgcgatgc 


agataacaaa 


agccaaggta 


atgttctaga gcgtcgccaa 


780 


cgcgatgctg 


ataacaaaag 


tcaaggtaat 


gttctagagc 


gtcgccaacg 


tgatgctgat 


840 


aacaagagcc 


aaggcaatgt 


tcttgagcgt 


cgtcaacgcg 


atgtcgataa 


caaaagtcag 


900 


ggtaatgttt 


tagagcgtcg 


ccaacgtgat 


gcggataaca 


agagtcaagg 


taatgtttta 


960 


gagcgtcgcc 


aacgcgatgc 


ggataacaag 


agccaaggta 


atgttttaga 


gcgtcgccaa 


1020 


cgcgatgcgg 


ataacaagag 


tcaaggtaat 


gttttagagc 


gtcgccaacg 


cgatgcggat 


1080 


aacaagagcc 


aaggtaatgt 


tttagagcgt 


cgccaacgcg 


atgcagataa 


caaaagtcaa 


1140 


ggtaatgttt 


tagagcgtcg 


ccaacgcgat 


gctgataaca 


agagccaagg 


taatgtttta 


1200 


gagcgtcgtc 


aacgtgatgc 


agataacaaa 


agtcagggca 


atgttttaga 


gcgtcgtcaa 


1260 


cgtgatgcgg 


ataacaagag 


ccaaggtaat 


gttttagagc 


gtcgccaacg 


tgatgcggat 


1320 


aacaagagcc 


agggcaatgt 


tctagaacgt 


cgtcaacgtg 


atgcggataa 


caagagccaa 


1380 


ggtaacgttt 


tagagcgtcg 


ccaacgtgat 


gcggataaca 


agagccaggg 


caatgtttta 


1440 


gagcgccgcc 


aacgcgatgc 


agataacaaa 


agtcaaggta 


atgttctaga 


gcgtcgccaa 


1500 


cgcgatgcag 


ataacaagag 


ccagggtaat 


gttctagagc 


gtcgccaacg 


cgatgcggaa 


1560 


aacaaaagtc 


aagtaggtca 


acttataggg 


aaaaatccac 


ttttttcaaa 


gtcaactgta 


1620 










aacagtcatt 


ctctaaaaaa 


1680 


atatctcagg 


ttactaatgt 


agctaatgga 


ccgatgttaa 


ctaataattc 


tagaacaatt 


1740 


tcagtgataa 


ataaattacc 


taaaacaggt 


gatgatcaaa 


atgtcatttt 


taaacttgta 


1800 


ggttttggtt 


taattttgtt 


aacaagtctc 


tgcggtttga 


gacgcaatga 


aaattaa 


1857 



<210> 7 
<211> 2706 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 7 

atgagaaaat accaaaaatt ttctaaaata ttgacgttaa gtcttttttg tttgtcgcaa 60 
ataccgctta ataccaatgt tttaggggaa agtaccgtac cggaaaatgg tgctaaagga 120 
aagttagttg ttaaaaagac agatgaccag aacaaaccac tttcaaaagc tacctttgtt 180 
ttaaaaacta ctgctcatcc agaaagtaaa atagaaaaag taactgctga gctaacaggt 240 
gaagctactt ttgataatct catacctgga gattatactt tatcagaaga aacagcgccc 300 
gaaggttata aaaagactaa ccagacttgg caagttaagg ttgagagtaa tggaaaaact 360 
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9 f^c^Sk ^ s>r*sks^s\ 
ciVi>^ci Cdv^ciiO ci 


a aa t crci t aa 


taaaaattcc 


acaat toaac 


aaaatcacoa 

GLGL6L^« 


agaac t aga t 


420 


a a a t* 5* ^ f* 


cccccacaaQ 


aatttatgaa 


Qatacaaaao 


aatcttataa 


acttgagcat 


480 


y \« i^oiciciyy ^ i» 


cagt'tccaaa 


tggaaagtca 


QaQQcaaaaQ 


cagttiaaccc 


atattcaagt 


540 


na a c^n ^a fro 
y ctdsj uy 


At'at:aaaaaa 

d^d L*«w>jwy « 


aattccagag 


ggaacattat 


ctaaacgtat: 


t: tcagaagta 


600 




r'h f^atiaataa 


atataaaatt 


craatitiaacta 


tcaQtQoaaa 


aaccatiagta 


660 


aaaccagtgg 


acaaacaaaa 


gccgttagat 


gttgtcttcg 


tactcgataa 


ttctaactca 


720 


atgaataacg 


atggcccaaa 


ttttcaaagg 


cataataaag 


ccaagaaagc 


tgccgaagct 


780 


cttgggaccg 


cagtaaaaga 


tattttagga 


gcaaacagtg 


ataatagggt 


tgcattagtt 


840 


acctatggtt 


cagatatttt 
attatiaacct 

QL W Iv ^* "'Si? 


tgatggtagg 
tcaaactaag 


agtgtagatg 
t tcacaat tc 


tcgtaaaagg 
agacagagaa 


atttaaagaa 
ttatagtcat 


900 
960 


aaaf^a«)t*^3a 


caaataatgc 


t gaagaga 1 1 


ataaaaagga 


ttcctacaga 


agctcctaga 


1020 


y^ L.cidd^yyy 


craticaactac 


aaacggac 1 1 


actccagagc 


aacaaaagca 


gtactatctt 


1080 




gggaaacatt 


^actatgaaa 


QCCttCatQQ 

^3 ^1* W %r %^ W 


aggcagatga 


tattttgagt 


1140 


f^aadt*aciA^ 
k^ctdy i« dy d k» ^ 


aaaataat ca 


aaaaatitiatt 


gttcatataa 




tccaacaaga 


1200 


d I'd L»y w i»d 


ttiaatiaati^t: 


taaa t tgggt 


gcatcatatg 


aaagccaat t 


tgaacaaatg 


1260 


aaaaaaaa^fl 
dddddddd L»vj 


cratatctiaaa 

CL W GL Wp W WGLGLGL 


taaaagtaat 


tttctactta 


ctgataagcc 


cgaggatata 


1320 


a a acfcia a a ^ ci 
dddy y ddd *-y 


aaciaciaa 1 1 a 

y y y ^y 


ctttttgttt 


cccttagatsa 


gttatcaaac 


acagataatc 


1380 


tcfcraaacft 


tiacaaaaacti 


tcattattta 


gatttaaatc 


ttaattaccc 


taaaggtaca 


1440 


a t t t a t coaa 


aticraaccaoti 


aagagaacat 


ggaacaccaa 


ccaaacttta 


tataaatagt 


1500 




aaaatitatoa 


catcttl^aat 


tttggtatag 


atatatctgc 


ttttagacaa 


1560 


y be ^d b>dd ^y 


aciaa t tat aa 


gaaaaatcaa 


aatQQtactt 


ttcaaaaatt 


qaaaaaQQaa 


1620 


ere tfc ttaaac 

y w k ^ ^ ^y dci^ 


tttcaaatao 


ggaaataaca 


gaactaatga 


agtcattctc 


ttctaaacct 


1680 


ydywdd^d v» c» 


ccccciatacft 

«9 ^ ^^^3 


aacttcatcc 

G»^^* W W W 


gatgcatcta 


acaatgaaat 


ttitiatctaaa 


1740 


a t tcaacaac 


aatttgaaaa 


ggttttiaaca 


aaagaaaact: 


caattgttaa 


tggaactata 


1600 


era aaa t eet a 


taaatoacaa 

i^yyy L»^a\i«GiCL 


aat caattta 


caacttQQca 


acggacaaac 


attgcaacca 


1860 


dy i-y d L> i» d d 


^ ^avvdyyy 


aaatcratocia 

ddd wy a ^yyoi 


aatataatoa 


aagatagcati 


tgcaac tggt 


1920 


#"t ^1 ^% M ^ a 9 ^ a 

yggccuddud 


d fcy duyy tyy 


ddtvd^^Wddd 


rracicittaaat 
wj y y y v wglglgl l» 


tagaatacat 


caaaaataaa 


1980 


ctctacgtta 


gaggtttgaa 


cttaggggag 


ggacaaaaag 


taacactcac 


atatgatgtg 


2040 


aaactagatg 


acagttttat 


aagtaacaaa 


ttctatgaca 


ctaatggtag 


aacaacattg 


2100 


aatcctaaat 


cagaggatcc 


taatacactt 


agagattttc 


caatccctaa 


aattcgtgat 


2160 


gtgagagaat 


atcctacaat 


aacgattaaa 


aacgagaaga 


agttaggtga 


aattgaattt 


2220 
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acaaaagttg ataaagataa taataagttg cttctcaaag gagctacgtt tgaacttcaa 2280 

gaatttaatg aagattataa actttattta ccaataaaaa ataataattc aaaagtagtg 2340 

acgggagaaa acggcaaaat ttcttacaaa gatttgaaag atggcaaata tcagttaata 2400 

gaagcagttt cgccgaagga ttatcaaaaa attactaata aaccaatttt aacttttgaa 2460 

gttgttaaag gatcgataca aaatataata gctgttaata aacagatttc tgaatatcat 2520 

gaggaaggtg acaagcattc aattaccaac acgcatattc caccaaaagg aattattccg 2580 

atgacaggtg ggaaaggaat tctatctttc attttaatag gtggatctat gatgtctatt 2640 

gcaggtggaa tttatatttg gaaaagatat aagaaatcta gtgatatatc tagagaaaaa 2700 

gattaa 2706 

<210> 8 

<211> 2025 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 8 



atgaaaaaaa 


tcaacaaatg 


tcttacagtg 


ttctcgacac 


tgctattgat 


cttaacgtca 


60 


ctattctcag 


ttgcaccagc 


gtttgcggac 


gacgtaacaa 


ctgatactgt 


gaccttgcac 


120 


aagattgtca 


tgccacaagc 


tgcatttgat 


aactttactg 


aaggtacaaa 


aggtaagaat 


180 


gatagcgatt 


atgttggtaa 


acaaattaat 


gaccttaaat 


cttattttgg 


ctcaaccgat 


240 


gctaaagaaa 


ttaagggtgc 


tttctttgtt 


ttcaaaaatg 


aaactggtac 


aaaattcatt 


300 


actgaaaatg 


gtaaggaagt 


cgatactttg 


gaagctaaag 


atgctgaagg 


tggtgctgtt 


360 


ctttcagggt 


taacaaaaga 


cactggtttt 


gcttttaaca 


ctgctaagtt 


aaaaggaact 


420 


taccaaatcg 


ttgaattgaa 


agaaaaatca 


aactacgata 


acaacggttc 


tatcttggct 


480 


gattcaaaag 


cagttccagt 


taaaatcact 


ctgccattgg 


taaacaacca 


aggtgttgtt 


540 


aaagatgctc 


acatttatcc 


aaagaatact 


gaaacaaaac 


cacaagtaga 


taagaacttt 


600 


gcagataaag 


atcttgatta 


tactgacaac 


cgaaaagaca 


aaggtgttgt 


ctcagcgaca 


660 


gttggtgaca 


aaaaagaata 


catagttgga 


acaaaaattc 


ttaaaggctc 


agactataag 


720 


aaactggttt 


ggactgatag 


catgactaaa 


ggtttgacgt 


tcaacaacaa 


cgttaaagta 


780 


acattggatg 


gtaaagattt 


tcctgtttta 


aactacaaac 


tcgtaacaga 


tgaccaaggt 


840 


ttccgtcttg 


ccttgaatgc 


aacaggtctt 


gcagcagtag 


cagctgctgc 


aaaagacaaa 


900 


gatgttgaaa 


tcaagatcac 


ttactcagct 


acggtgaacg 


gctccactac 


tgttgaagtt 


960 


ccagaaacca 


atgatgttaa 


attggactat 


ggtaataacc 


caacggaaga 


aagtgaacca 


1020 


caagaaggta 


ctccagctaa 


ccaagaaatt 


aaagtcatta 


aagactgggc 


agtagatggt 


1080 
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acaattactg 


atgttaatgt 


tgcagttaaa 


8 

gctatcttta 


ccttgcaaga 


aaaacaaacg 


1140 


gatggtacat 


gggtgaacgt 


tgcttcacac 


gaagcaacaa 


aaccatcacg 


ctttgaacat 


1200 


actttcacag 


gtttggataa 


tactaaaact 


taccgcgttg 


tcgaacgtgt 


tagcggctac 


1260 


actccagaat 


atgtatcatt 


taaaaatggt 


gttgtgacta 


tcaagaacaa 


caaaaactca 


1320 


aatgatccaa 


ctccaatcaa 


cccatcagaa 


ccaaaagtgg 


tgacttatgg 


acgtaaattt 


1380 


gtgaaaacaa 


atcaagctaa 


cactgaacgc 


ttggcaggag 


ctaccttcct 


tgttaagaaa 


1440 


gaaggaaaat 


acttggcacg 


taaagcaggt 


gcagcaactg 


ctgaagcaaa 


ggcagctgta 


1500 


aaaactgcta 


aactagcatt 


ggatgaagct 


gttaaagctt 


ataacgactt 


gactaaagaa 


1560 


aaacaagaag 


gccaagaagg 


taaaacagca 


ttggctactg 


ttgatcaaaa 


acaaaaagct 


1620 


tacaatgacg 


cttttgttaa 


agctaactac 


tcatatgaat 


gggttgcaga 


taaaaaggct 


1680 


gataatgttg 


ttaaattgat 


ctctaacgcc 


ggtggtcaat 


ttgaaattac 


tggtttggat 


1740 


aaaggcactt 


atagcttgga 


agaaactcaa 


gcaccagcag 


gttatgcgac 


attgtcaggt 


1800 


gatgtaaact 


ttgaagtaac 


tgccacatca 


tatagcaaag 


gggctacaac 


tgacatcgca 


1860 


tataataaacr 


gatctgtaaa 


aaaagat gc c 


caacaagttc 


aaaacaaaaa 


agtaaccatc 


1920 


ccacaaacag 


gtggtattgg 


tacaattctt 


ttcacaatta 


ttggtttaag 


cattatgctt 


1980 


ggagcagtag 


ttgtcatgaa 


aaaacgtcaa 


tcagaggaag 


cttaa 




2025 


<210> 9 
<211> 1908 
<212> DNA 

<213> Streptococcus agalactiae 










<400> 9 
atgaaaaaac 


aatttttaaa 


atcagcagcg 


attctatcgc 


tagcagtaac 


agcagtatct 


60 


acaagtcagc 


cggtagccgg 


gataactaaa 


gattataata 


accgaaatga 


aaaagtaaaa 


120 


aagtatttac 


aagaaaataa 


tttcggtcat 


aaaatagcgt 


atggatggaa 


aaataaagta 


180 


gaatttgatt 


ttcgttattt 


attggatact 


gctaaatatt 


tagtaaataa 


agaagaattt 


240 


caagatcctt 


tatataatga 


tgcgcgcgaa 


gaattgataa 


gttttatttt 


tccttatgag 


300 


aaatttttaa 


ttaacaatcg 


tgacataact 


aaattaacag 


ttaatcagta 


tgaagcgatt 


360 


gtgaatagaa 


tgagtgttgc 


tttacaaaaa 


ttttcaaaga 


atatttttga 


gaaacagaaa 


420 


gtaaataaag 


atttaatccc 


tattgcgttt 


tggattgaga 


aaagttacag 


aactgttgga 


480 


acgaatgaaa 


tcgccgcttc 


tgtaggcatt 


caaggaggat 


tttatcaaaa 


cttccatgat 


540 


tattataatt 


attcatatct 


attaaattct 


ttatggcatg 


aaggaaatgt 


aaaagaagta 


600 


gttaaggatt 


atgaaaacac 


tattcgtcaa 


atactatcta 


aaaagcatga 


gattgaaaaa 


660 
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at:^cttaatc 


agagcacttc 


taatatictict 


ataoatioato 


atgattacga 


aaaaggaaat 


720 


aaagaat^gc 


taagggaaaa 


attaaatatt 


attctaaatc 


tttcaaagag 


agattacaga 


780 


gtaactccat: 


actatigaagc 


gaataaacta 


catacaaoQC 


ttattttatt 




840 


cct^aatit^tiaa 


aga ti ti g c t: aa 


QQataaQttQ 


ttctcattag 


agaattcttt 


aaaggaatac 


900 


aaaQoaQaaa 


aagttaatta 


tgaggaacta 


agatitcaata 


cggaacctct 


aactagttac 


960 


t: agaaaa t: a 


aagaaaaatt 


tttagtcccc 


aatattccat 


ataaaaataa 


attaatttta 


1020 


acfQQaaQaaci 


ataaatatag 


ttttgaagat 


QatoaaaaaQ 

iS 3 -J -U 


agtttggaaa 


tgaactticta 


1080 


agttacaata 


agcttaagaa 


tgaagtttta 


cctgttaata 


ttacaacttc 


tactatatta 


1140 


aaaccgt^t:g 


aacagaagaa 


aattgtggaa 


gattttaatc 


cttattctaa 


ttitagacaat 


1200 


^tagaaataa 


aaaaaataag 


QttQaatQQC 


tcccaaaaac 


aaaaagtaga 


acaggaaaaa 


1260 


actaaatcgc 


caactcctca 


aaaagagact 


gtgaaagaac 


aaactgagca 


aaaagtatct 


1320 


gcf aaa t a c t: c 


aagaggtaga 


aaagaaatct 


gaaactgtgg 


caactitcaca 


acaaagttca 


1380 


at tQCOcaaa 


ctztctigtcca 


acagccggcC 


ccggttcaat 


cagttgttca 


agaatccaaa 


1440 


gctztzcticaacf 


aggagattaa 


tgcagcacac 


gatgctattt 


cggcgtataa 


atcaacagtc 


1500 


aatattgcta 


atacagccgg 


tgtaacaact: 


gcggaaatga 


ccacgctcat 


taatiactcaa 


1560 


ac t: t: c tia a t: c 


tttctgatgt 


tgagaaagct 


ttaggaaata 


ataaggttaa 


taatggtgca 


1620 


gtcaat9t:at 


tigagagaaga 


t:acagct:cgt 


cttgagaata 


tgatttggaa 


tcgtgctitac 


1680 


caagctattg 


aagaattcaa 


cgtcgctcgt 


aatacttata 


ataaccaaat 


caagacagaa 


1740 


acagttccag 


ttgataatga 


tattgaagct 


attttagcag 


gttctcaagc 


taaaattagc 


1800 


catttggaca 


atcgtatcgg 


agcgcgccac 


atggatcaag 


cttttgtagc 


tagtttatta 


1860 


gaagttactg 


agatgagtaa 


atcaatctca 


tcgcgtataa 


aagagtag 




1908 



<210> 10 
<211> 546 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 10 

atgaaaaaaa taacaacttt aatcttagct agtagcttat tactagttgc aacgacatcg 60 
gttaaagctg atgataactt tgaaatgcca acgcgttatg ttaaaatgag tgaaaaatca 120 
aaagcatttt atcaaagact acaagaaaaa caacgtaagg cacatactac tgtgaagact 180 

atctacaatt taggcattct tatttctcag gagtctaaag ggttcatcca acgtattgat 300 
aatgcctatt ctttggaaaa tgtctcagat attgttaatg aagctcaggc tttgtataaa 360 
cgtaactatg atttatttga aaaaatcaaa tctacacgtg ataaggttca agtcttactt 420 
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gcatcgcatc aagataatac agacttaaaa aacttttatg ctgagttaga tgatatgtat 480 
gaacatgtt:t atctcaatga aagtagagtg gaggcgataa acagaaatat ccaaaaatat 540 
aattag 546 

<210> 11 
<211> 442 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 11 

Met Phe Asn Lys lie Gly Phe Arg Thr Tip Lys Ser Gly Lys Leu Trp 
15 10 15 

Leu Tyr Met Gly Val Leu Gly Ser Thr lie lie Leu Gly Ser Ser Pro 
20 25 30 

Val Ser Ala Met Asp Ser Val Gly Asn Gin Ser Gin Gly Asn Val Leu 
35 40 45 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
50 55 60 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
65 70 75 80 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
85 90 95 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
100 105 110 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
115 120 125 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
130 135 140 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
145 150 155 160 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
165 170 175 



Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
180 185 190 
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11 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
195 200 205 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
210 215 220 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
225 230 235 240 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
245 250 255 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
260 265 270 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
275 280 285 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
290 295 300 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
305 310 315 320 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
325 330 335 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Val Gly Gin Leu 
340 345 350 



He Gly Lys Asn Pro Leu Leu Ser Lys Ser He He Ser Arg Glu Asn 
355 360 365 



Asn His Ser Ser Gin Gly Asp Ser Asn Lys Gin Ser Phe Ser Lys Lys 
370 375 380 



Val Ser Gin Val Thr Asn Val Ala Asn Arg Pro Met Leu Thr Asn Asn 
385 390 395 400 



Ser Arg Thr He Ser Val He Asn Lys Leu Pro Lys Thr Gly Asp Asp 
405 410 415 



Gin Asn Val He Phe Lys Leu Val Gly Phe Gly Leu He Leu Leu Thr 
420 425 430 
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Ser Arg Cys Gly Leu Arg Arg Asn 6lu Asn 
435 440 



<210> 12 
<211> 410 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 12 

Met Phe Asn Lys lie Gly Phe Arg Thr Trp Lys Ser Gly Lys Leu Trp 
15 10 15 



Leu Tyr Met Gly Val Leu Gly Ser Thr lie He Leu Gly Ser Ser Ser 
20 25 30 



Val Ser Ala Met Asp Ser Val Gly Asn Gin Ser Gin Gly Asn Val Leu 
35 40 45 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
50 55 60 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
65 70 75 80 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
85 90 95 



Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
100 105 110 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
115 120 125 



Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
130 135 140 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
145 150 155 160 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
180 185 190 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
195 200 205 
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13 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
210 215 220 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
225 230 235 240 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
245 250 255 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
260 265 270 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
275 280 285 



Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
290 295 300 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Val Gly Gin Leu 
305 310 315 320 



lie Gly Lys Asn Pro Leu Leu Ser Lys Ser lie lie Ser Arg Glu Asn 
325 330 335 



Asn His Ser Ser Gin Gly Asp Ser Asn Lys Gin Ser Phe Ser Lys Lys 
340 345 350 



Val Ser Gin Val Thr Asn Val Ala Asn Arg Pro Met Leu Thr Asn Asn 
355 360 365 



Ser Arg Thr lie Ser Val lie Asn Lys Leu Pro Lys Thr Gly Asp Asp 
370 375 380 



Gin Asn Val lie Phe Lys Leu Val Gly Phe Gly Leu lie Leu Leu Thr 
385 390 395 400 



Ser Arg Cys Gly Leu Arg Arg Asn Glu Asn 
405 410 



<210> 13 

<211> 346 

<212> PRT 

<213> Streptococcus agalactiae 



<400> 13 
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14 

Met Phe Asn Lys lie Gly Phe Arg Thr Trp Lys Ser Gly Lys Leu Trp 
15 10 15 

Leu Tyr Met Gly Val Leu Gly Ser Thr lie lie Leu Gly Ser Ser Pro 
20 25 30 

Val Ser Ala Met Asp Ser Val Gly Asn Gin Ser Gin Gly Asn Val Leu 
35 40 45 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
50 55 60 

Glu Arg Arg Gin Arg Asp Val Glu Asn Arg Ser Gin Gly Asn Val Leu 
65 70 75 80 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
85 90 95 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
100 105 110 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
115 120 125 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
130 135 140 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
145 150 155 160 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
165 170 175 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
180 185 190 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn val Leu 
195 200 205 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
210 215 220 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
225 230 235 240 



Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Val Gly Gin Leu 
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245 250 255 

He Gly Lys Asn Pro Leu Leu Ser Lys Ser Thr He Ser Arg Glu Asn 
260 265 270 

Asn His Ser Ser Gin Gly Asp Ser Asn Lys Gin Ser Phe Ser Lys Lys 

275 280 285 

Val Ser Gin Val Thr Asn Val Ala Asn Arg Pro Met Leu Thr Asn Asn 
290 295 300 

Ser Arg Thr He Ser Val He Asn Lys Leu Pro Lys Thr Gly Asp Asp 
305 310 315 320 

Gin Asn Val He Phe Lys Leu Val Gly Phe Gly Leu He Leu Leu Thr 
325 330 335 



Ser Arg Cys Gly Leu Arg Arg Asn Glu Asn 
340 345 



<210> 14 
<211> 186 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 14 

Met Phe Asn Lys He Gly Phe Arg Thr Trp Lys Ser Gly Lys Leu Trp 
15 10 15 



Leu Tyr Met Gly Val Leu Gly Ser Thr He He Leu Gly Ser Ser Pro 
20 25 30 



Val Ser Ala Met Asp Ser Val Gly Asn Gin Ser Gin Gly Asn Val Leu 
35 40 45 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
50 55 60 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
65 70 75 80 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Val Gly Gin Leu 
85 90 95 



He Gly Lys Asn Pro Leu Leu 
100 



Ser Lys Ser He He Ser Arg Glu Asn 
105 110 
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Asn His Ser Ser Gin Gly Asp Ser Asn Lys Gin Ser Phe Ser Lys Lys 
115 120 125 

Val Ser Gin Val Thr Asn Val Ala Asn Arg Pro Met Leu Thr Asn Asn 
130 135 140 

Ser Arg Thr He Ser Val He Asn Lys Leu Pro Lys Thr Gly Asp Asp 
145 150 155 160 

Gin Asn Val He Phe Lys Leu Val Gly Phe Gly Leu He Leu Leu Thr 
165 170 175 

Ser Arg Cys Gly Leu Arg Arg Asn Glu Asn 
180 185 

<210> 15 
<211> 298 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 15 

Met Phe Asn Lys He Gly Phe Arg Thr Trp Lys Ser Gly Lys Leu Trp 
15 10 15 

Leu Tyr Met Gly Val Leu Gly Ser Thr He He Leu Gly Ser Ser Pro 
20 25 30 

Val Ser Ala Met Asp Ser Val Gly Asn Gin Ser Gin Gly Asn Val Leu 
35 40 45 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
50 55 60 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
65 70 75 80 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
85 90 95 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
100 105 110 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
115 120 125 



Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
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130 135 140 

Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin Gly Asn Val Leu 
145 150 155 160 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Gly Asn Val Leu 
165 170 175 

Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin Gly Asn Val Leu 
180 185 190 

Glu Arg Arg Gin His Asp Val Glu Asn Lys Ser Gin Val Gly Gin Leu 
195 200 205 

He Gly Lys Asn Pro Leu Phe Ser Lys Ser Thr Val Ser Arg Glu Asn 
210 215 220 

Asn His Ser Ser Gin Gly Asp Ser Asn Lys Gin Ser Phe Ser Lys Lys 
225 230 235 240 

Val Ser Gin Val Thr Asn Val Ala Asn Arg Pro Met Leu Thr Asn Asn 

Ser Arg Thr He Ser Val He Asn Lys Leu Pro Lys Thr Gly Asp Asp 
260 265 270 

Gin Asn Val He Phe Lys Leu Val Gly Phe Gly Leu He Leu Leu Thr 
275 280 285 

Ser Leu Cys Gly Leu Arg Arg Asn Glu Asn 
290 295 



<210> 16 
<211> 618 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 16 

Met Phe Asn Lys He Gly Phe Arg Thr Trp Lys Ser Gly Lys Leu Trp 
15 10 15 



Leu Tyr Met Gly Val Leu Gly Ser Thr He He Leu Gly Ser Ser Pro 
20 25 30 



Val Ser Ala Met Asp Ser Val Gly Asn Gin Ser Gin Gly Asn Val Leu 
35 40 45 
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Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
50 55 60 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
65 70 75 80 

Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin Gly Asn Val Leu 
85 90 95 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
100 105 110 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 

Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin Gly Asn Val Leu 
130 135 140 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
145 150 155 160 

Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin Gly Asn Val Leu 
165 170 175 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
180 185 190 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
195 200 205 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
210 215 220 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
225 230 235 240 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
245 250 255 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
260 265 270 

Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
275 280 285 



Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin Gly Asn Val Leu 
290 295 300 
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Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
305 310 315 320 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
325 330 335 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
340 345 350 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
355 360 365 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
370 375 380 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
385 390 395 400 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
405 410 415 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
420 425 430 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
435 440 445 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
450 455 460 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
465 470 475 480 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
485 490 495 



Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin Gly Asn Val Leu 
500 505 510 



Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin Val Gly Gin Leu 
515 520 525 



lie Gly Lys Asn Pro Leu Phe Ser Lys Ser Thr Val Ser Arg Glu Asn 
530 535 540 
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^ 20 

Asn His Ser Ser Gin Gly Asp Ser Asn Lys Gin Ser Phe Ser Lys Lys 
545 550 555 560 

lie Ser Gin Val Thr Asn Val Ala Asn Gly Pro Met Leu Thr Asn Asn 
565 570 575 

Ser Arg Thr lie Ser Val lie Asn Lys Leu Pro Lys Thr Gly Asp Asp 
580 585 590 

Gin Asn Val lie Phe Lys Leu Val Gly Phe Gly Leu lie Leu Leu Thr 
595 600 605 

Ser Leu Cys Gly Leu Arg Arg Asn Glu Asn 
610 615 



<210> 17 
<211> 901 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 17 

Met Arg Lys Tyr Gin Lys Phe Ser Lys lie Leu Thr Leu Ser Leu Phe 
15 10 15 



Cys Leu Ser Gin He Pro Leu Asn Thr Asn Val Leu Gly Glu Ser Thr 
20 25 30 



Val Pro Glu Asn Gly Ala Lys Gly Lys Leu Val Val Lys Lys Thr Asp 
35 40 45 



Asp Gin Asn Lys Pro Leu Ser Lys Ala Thr Phe Val Leu Lys Thr Thr 
50 55 60 



Ala His Pro Glu Ser Lys He Glu Lys Val Thr Ala Glu Leu Thr Gly 
65 70 75 80 



Glu Ala Thr Phe Asp Asn Leu He Pro Gly Asp Tyr Thr Leu Ser Glu 
85 90 95 



Glu Thr Ala Pro Glu Gly Tyr Lys Lys Thr Asn Gin Thr Trp Gin Val 
100 105 110 



Lys Val Glu Ser Asn Gly Lys Thr Thr He Gin Asn Ser Gly Asp Lys 
115 120 125 



Asn Ser Thr He Gly Gin Asn His Glu Glu Leu Asp Lys Gin Tyr Pro 
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130 135 140 

Pro Thr Gly lie Tyr Glu Asp Thr Lys Glu Ser Tyr Lys Leu Glu His 
145 150 155 160 

Val Lys Gly Ser val Pro Asn Gly Lys Ser Glu Ala Lys Ala Val Asn 
165 170 175 

Pro Tyr Ser Ser Glu Gly Glu His He Arg Glu He Pro Glu Gly Thr 

Leu Ser Lys Arg He Ser Glu Val Gly Asp Leu Ala His Asn Lys Tyr 
195 200 205 

Lys He Glu Leu Thr Val Ser Gly Lys Thr He Val Lys Pro Val Asp 
210 215 220 

Lys Gin Lys Pro Leu Asp Val Val Phe Val Leu Asp Asn Ser Asn Ser 
225 230 235 240 

Met Asn Asn Asp Gly Pro Asn Phe Gin Arg His Asn Lys Ala Lys Lys 
245 250 255 

Ala Ala Glu Ala Leu Gly Thr Ala Val Lys Asp He Leu Gly Ala Asn 
260 265 270 

Ser Asp Asn Arg Val Ala Leu Val Thr Tyr Gly Ser Asp He Phe Asp 
275 280 285 

Gly Arg Ser Val Asp Val Val Lys Gly Phe Lys Glu Asp Asp Lys Tyr 
290 295 300 

Tyr Gly Leu Gin Thr Lys Phe Thr He Gin Thr Glu Asn Tyr Ser His 
305 310 315 320 

Lys Gin Leu Thr Asn Asn Ala Glu Glu He He Lys Arg He Pro Thr 
325 330 335 

Glu Ala Pro Arg Ala Lys Trp Gly Ser Thr Thr Asn Gly Leu Thr Pro 
340 345 350 

Glu Gin Gin Lys Gin Tyr Tyr Leu Ser Lys Val Gly Glu Thr Phe Thr 
355 360 365 



Met Lys Ala Phe Met Glu Ala Asp Asp He Leu Ser Gin Val Asp Arg 
370 375 380 
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Asn Ser Gin Lys lie lie Val His lie Thr Asp Gly Val Pro Thr Arg 
385 390 395 400 

Ser Tyr Ala lie Asn Asn Phe Lys Leu Gly Ala Ser Tyr Glu Ser Gin 
405 410 415 

Phe Glu Gin Met Lys Lys Asn Gly Tyr Leu Asn Lys Ser Asn Phe Leu 
420 425 430 

Leu Thr Asp Lys Pro Glu Asp lie Lys Gly Asn Gly Glu Ser Tyr Phe 
435 440 445 

Leu Phe Pro Leu Asp Ser Tyr Gin Thr Gin lie lie Ser Gly Asn Leu 
450 455 460 

Gin Lys Leu His Tyr Leu Asp Leu Asn Leu Asn Tyr Pro Lys Gly Thr 
465 470 475 480 

He Tyr Arg Asn Gly Pro Val Arg Glu His Gly Thr Pro Thr Lys Leu 
485 490 495 

Tyr He Asn Ser Leu Lys Gin Lys Asn Tyr Asp He Phe Asn Phe Gly 
500 505 510 

He Asp He Ser Ala Phe Arg Gin Val Tyr Asn Glu Asp Tyr Lys Lys 
515 520 525 

Asn Gin Asp Gly Thr Phe Gin Lys Leu Lys Glu Glu Ala Phe Glu Leu 
530 535 540 

Ser Asp Gly Glu He Thr Glu Leu Met Lys Ser Phe Ser Ser Lys Pro 
545 550 555 560 

Glu Tyr Tyr Thr Pro He Val Thr Ser Ser Asp Ala Ser Asn Asn Glu 
565 570 575 

He Leu Ser Lys He Gin Gin Gin Phe Glu Lys Val Leu Thr Lys Glu 
580 585 590 

Asn Ser He Val Asn Gly Thr He Glu Asp Pro Met Gly Asp Lys He 
595 600 605 



Asn Leu Gin Leu Gly Asn Gly Gin Thr Leu Gin Pro Ser Asp Tyr Thr 
610 615 620 
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Leu Gin Gly Asn Asp Gly Ser He Met Lys Asp Ser He Ala Thr Gly 
625 630 635 640 

Gly Pro Asn Asn Asp Gly Gly He Leu Lys Gly Val Lys Leu Glu Tyr 
645 650 655 

He Lys Asn Lys Leu Tyr Val Arg Gly Leu Asn Leu Gly Glu Gly Gin 
660 665 670 

Lys Val Thr Leu Thr Tyr Asp Val Lys Leu Asp Asp Ser Phe He Ser 
675 680 685 

Asn Lys Phe Tyr Asp Thr Asn Gly Arg Thr Thr Leu Asn Pro Lys Ser 

Glu Asp Pro Asn Thr Leu Arg Asp Phe Pro He Pro Lys He Arg Asp 
705 710 715 720 

Val Arg Glu Tyr Pro Thr He Thr He Lys Asn Glu Lys Lys Leu Gly 
725 730 735 

Glu He Glu Phe Thr Lys Val Asp Lys Asp Asn Asn Lys Leu Leu Leu 
740 745 750 

Lys Gly Ala Thr Phe Glu Leu Gin Glu Phe Asn Glu Asp Tyr Lys Leu 
755 760 765 

Tyr Leu Pro He Lys Asn Asn Asn Ser Lys Val Val Thr Gly Glu Asn 
770 775 780 

Gly Lys He Ser Tyr Lys Asp Leu Lys Asp Gly Lys Tyr Gin Leu He 
785 790 795 800 

Glu Ala Val Ser Pro Lys Asp Tyr Gin Lys He Thr Asn Lys Pro He 
805 810 815 

Leu Thr Phe Glu Val Val Lys Gly Ser He Gin Asn He He Ala Val 
820 825 830 

Asn Lys Gin He Ser Glu Tyr His Glu Glu Gly Asp Lys His Leu He 
835 840 845 

Thr Asn Thr His He Pro Pro Lys Gly He He Pro Met Thr Gly Gly 
850 855 860 



Lys Gly He Leu Ser Phe He Leu He Gly Gly Ser Met Met Ser He 
865 870 875 880 
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Ala Gly Gly He Tyr He Trp Lys Arg Tyr Lys Lys Ser Ser Asp He 
885 890 895 



Ser Arg Glu Lys Asp 
900 



<210> 18 

<211> 674 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 18 



Met Lys Lys He Asn Lys Cys Leu Thr Val Phe Ser Thr Leu Leu Leu 
15 10 15 



He Leu Thr Ser Leu Phe Ser Val Ala Pro Ala Phe Ala Asp Asp Val 
20 25 30 



Thr Thr Asp Thr Val Thr Leu His Lys He Val Met Pro Gin Ala Ala 
35 40 45 



Phe Asp Asn Phe Thr Glu Gly Thr Lys Gly Lys Asn Asp Ser Asp Tyr 
50 55 60 



Val Gly Lys Gin He Asn Asp Leu Lys Ser Tyr Phe Gly Ser Thr Asp 
65 70 75 80 



Ala Lys Glu He Lys Gly Ala Phe Phe Val Phe Lys Asn Glu Thr Gly 
85 90 95 



Thr Lys Phe He Thr Glu Asn Gly Lys Glu Val Asp Thr Leu Glu Ala 
100 105 110 



Lys Asp Ala Glu Gly Gly Ala Val Leu Ser Gly Leu Thr Lys Asp Thr 
115 120 125 



Gly Phe Ala Phe Asn Thr Ala Lys Leu Lys Gly Thr Tyr Gin He Val 
130 135 140 



Glu Leu Lys Glu Lys Ser Asn Tyr Asp Asn Asn Gly Ser He Leu Ala 
145 150 155 160 



Asp Ser Lys Ala Val Pro Val Lys He Thr Leu Pro Leu Val Asn Asn 
165 170 175 
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^ 25 ^ 

Gin Gly Val Val Lys Asp Ala His lie Tyr Pro Lys Asn Thr Glu Thr 
180 185 190 

Lys Pro Gin Val Asp Lys Asn Phe Ala Asp Lys Asp Leu Asp Tyr Thr 
195 200 205 

Asp Asn Arg Lys Asp Lys Gly Val Val Ser Ala Thr Val Gly Asp Lys 
210 215 220 

Lys Glu Tyr lie Val Gly Thr Lys lie Leu Lys Gly Ser Asp Tyr Lys 
225 230 235 240 

Lys Leu Val Trp Thr Asp Ser Met Thr Lys Gly Leu Thr Phe Asn Asn 
245 250 255 



Asn Val Lys Val Thr Leu Asp Gly Lys Asp Phe Pro Val Leu Asn Tyr 
260 265 270 



Lys Leu Val Thr Asp Asp Gin Gly Phe Arg Leu Ala Leu Asn Ala Thr 
275 280 285 



Gly Leu Ala Ala Val Ala Ala Ala Ala Lys Asp Lys Asp Val Glu He 



Lys He Thr Tyr Ser Ala Thr Val Asn Gly Ser Thr Thr Val Glu Val 
305 310 315 320 



Pro Glu Thr Asn Asp Val Lys Leu Asp Tyr Gly Asn Asn Pro Thr Glu 
325 330 335 



Glu Ser Glu Pro Gin Glu Gly Thr Pro Ala Asn Gin Glu lie Lys Val 
340 345 350 



He Lys Asp Trp Ala Val Asp Gly Thr He Thr Asp Val Asn Val Ala 
355 360 365 



Val Lys Ala He Phe Thr Leu Gin Glu Lys Gin Thr Asp Gly Thr Trp 
370 375 380 



Val Asn Val Ala Ser His Glu Ala Thr Lys Pro Ser Arg Phe Glu His 
385 390 395 400 



Thr Phe Thr Gly Leu Asp Asn Thr Lys Thr Tyr Arg Val Val Glu Arg 
405 410 415 



Val Ser Gly Tyr Thr Pro Glu Tyr Val Ser Phe Lys Asn Gly Val Val 
420 425 430 
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Thr lie Lys Asn Asn Lys Asn Ser Asn Asp Pro Thr Pro lie Asn Pro 
435 440 445 



Ser Glu Pro Lys Val Val Thr Tyr Gly Arg Lys Phe Val Lys Thr Asn 
450 455 460 



Gin Ala Asn Thr Glu Arg Leu Ala Gly Ala Thr Phe Leu Val Lys Lys 
465 470 475 480 



Glu Gly Lys Tyr Leu Ala Arg Lys Ala Gly Ala Ala Thr Ala Glu Ala 
485 490 495 



Lys Ala Ala Val Lys Thr Ala Lys Leu Ala Leu Asp Glu Ala Val Lys 
500 505 510 



Ala Tyr Asn Asp Leu Thr Lys Glu Lys Gin Glu Gly Gin Glu Gly Lys 
515 520 525 



Thr Ala Leu Ala Thr Val Asp Gin Lys Gin Lys Ala Tyr Asn Asp Ala 
530 535 540 



Phe Val Lys Ala Asn Tyr Ser Tyr Glu Trp Val Ala Asp Lys Lys Ala 
545 550 555 560 



Asp Asn Val Val Lys Leu lie Ser Asn Ala Gly Gly Gin Phe Glu lie 
565 570 575 



Thr Gly Leu Asp Lys Gly Thr Tyr Ser Leu. Glu Glu Thr Gin Ala Pro 
580 585 590 



Ala Gly Tyr Ala Thr Leu Ser Gly Asp Val Asn Phe Glu Val Thr Ala 
595 600 605 



Thr Ser Tyr Ser Lys Gly Ala Thr Thr Asp lie Ala Tyr Asp Lys Gly 
610 615 620 



Ser Val Lys Lys Asp Ala Gin Gin Val Gin Asn Lys Lys Val Thr lie 
625 630 635 640 



Pro Gin Thr Gly Gly lie Gly Thr lie Leu Phe Thr lie lie Gly Leu 
645 650 655 



Ser lie Met Leu Gly Ala Val Val Val Met Lys Lys Arg Gin Ser Glu 
660 665 670 
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Glu Ala 



<210> 19 

<211> 635 

<212> PRT 

<213> Streptococcus agalactiae 



<400> 19 

Met Lys Lys Gin Phe Leu Lys Ser Ala Ala lie Leu Ser Leu Ala Val 
15 10 15 



Thr Ala Val Ser Thr Ser Gin Pro Val Ala Gly lie Thr Lys Asp Tyr 
20 25 30 



Asn Asn Arg Asn Glu Lys Val Lys Lys Tyr Leu Gin Glu Asn Asn Phe 
35 40 45 



Gly His Lys lie Ala Tyr Gly Trp Lys Asn Lys Val Glu Phe Asp Phe 
50 55 60 



Arg Tyr Leu Leu Asp Thr Ala Lys Tyr Leu Val Asn Lys Glu Glu Phe 
65 70 75 80 



Gin Asp Pro Leu Tyr Asn Asp Ala Arg Glu Glu Leu lie Ser Phe lie 
85 90 95 



Phe Pro Tyr Glu Lys Phe Leu lie Asn Asn Arg Asp lie Thr Lys Leu 
100 105 110 



Thr Val Asn Gin Tyr Glu Ala lie Val Asn Arg Met Ser Val Ala Leu 
115 120 125 



Gin Lys Phe Ser Lys Asn lie Phe Glu Lys Gin Lys Val Asn Lys Asp 
130 135 140 



Leu He Pro He Ala Phe Trp He Glu Lys Ser Tyr Arg Thr Val Gly 
145 150 155 160 



Thr Asn Glu He Ala Ala Ser Val Gly He Gin Gly Gly Phe Tyr Gin 
165 170 175 



Asn Phe His Asp Tyr Tyr Asn Tyr Ser Tyr Leu Leu Asn Ser Leu Trp 
180 185 190 



His Glu Gly Asn Val Lys Glu Val Val Lys Asp Tyr Glu Asn Thr He 
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195 200 205 

Arg Gin lie Leu Ser Lys Lys His Glu lie Glu Lys lie Leu Asn Gin 
210 215 220 

Ser Thr Ser Asp lie Ser He Asp Asp Asp Asp Tyr Glu Lys Gly Asn 
225 230 235 240 

Lys Glu Leu Leu Arg Glu Lys Leu Asn He He Leu Asn Leu Ser Lys 
245 250 255 

Arg Asp Tyr Arg Val Thr Pro Tyr Tyr Glu Val Asn Lys Leu His Thr 
260 265 270 

Gly Leu He Leu Leu Glu Asp Val Pro Asn Leu Lys He Ala Lys Asp 
275 280 285 

Lys Leu Phe Ser Leu Glu Asn Ser Leu Lys Glu Tyr Lys Gly Glu Lys 
290 295 300 

Val Asn Tyr Glu Glu Leu Arg Phe Asn Thr Glu Pro Leu Thr Ser Tyr 
305 310 315 320 

Leu Glu Asn Lys Glu Lys Phe Leu Val Pro Asn He Pro Tyr Lys Asn 
325 330 335 

Lys Leu He Leu Arg Glu Glu Asp Lys Tyr Ser Phe Glu Asp Asp Glu 
340 345 350 

Glu Glu Phe Gly Asn Glu Leu Leu Ser Tyr Asn Lys Leu Lys Asn Glu 
355 360 365 

Val Leu Pro Val Asn He Thr Thr Ser Thr He Leu Lys Pro Phe Glu 
370 375 380 

Gin Lys Lys He Val Glu Asp Phe Asn Pro Tyr Ser Asn Leu Asp Asn 
385 390 395 400 

Leu Glu He Lys Lys He Arg Leu Asn Gly Ser Gin Lys Gin Lys Val 
405 410 415 

Glu Gin Glu Lys Thr Lys Ser Pro Thr Pro Gin Lys Glu Thr Val Lys 

Glu Gin Thr Glu Gin Lys Val Ser Gly Asn Thr Gin Glu Val Glu Lys 
435 440 445 



wo 2004/035618 PCT/£P2003/011436 

29 

Lys Ser Glu Thr Val Ala Thr Ser Gin Gin Ser Ser Val Ala Gin Thr 
450 455 460 

Ser Val Gin Gin Pro Ala Pro Val Gin Ser Val Val Gin Glu Ser Lys 
465 470 475 480 

Ala Ser Gin Glu Glu lie Asn Ala Ala His Asp Ala He Ser Ala Tyr 
485 490 495 

Lys Ser Thr Val Asn He Ala Asn Thr Ala Gly Val Thr Thr Ala Glu 
500 505 510 

Met Thr Thr Leu He Asn Thr Gin Thr Ser Asn Leu Ser Asp Val Glu 
515 520 525 

Lys Ala Leu Gly Asn Asn Lys Val Asn Asn Gly Ala Val Asn Val Leu 
530 535 540 

Arg Glu Asp Thr Ala Arg Leu Glu Asn Met He Trp Asn Arg Ala Tyr 
545 550 555 560 

Gin Ala He Glu Glu Phe Asn Val Ala Arg Asn Thr Tyr Asn Asn Gin 
565 570 575 

* 

He Lys Thr Glu Thr Val Pro Val Asp Asn Asp He Glu Ala He Leu 
580 585 590 

Ala Gly Ser Gin Ala Lys He Ser His Leu Asp Asn Arg He Gly Ala 
595 600 605 

Arg His Met Asp Gin Ala Phe Val Ala Ser Leu Leu Glu Val Thr Glu 
610 615 620 

Met Ser Lys Ser He Ser Ser Arg He Lys Glu 
625 630 635 



<210> 20 
<211> 181 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 20 

Met Lys Lys He Thr Thr Leu He Leu Ala Ser Ser Leu Leu Leu Val 
15 10 15 



Ala Thr Thr Ser Val Lys Ala Asp Asp Asn Phe Glu Met Pro Thr Arg 
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20 25 30 

Tyr Val Lys Met Ser Glu Lys Ser Lys Ala Phe Tyr Gin Arg Leu Gin 
35 40 45 

Glu Lys Gin Arg Lys Ala His Thr Thr Val Lys Thr Phe Asn Asn Ser 
50 55 60 

Glu lie Arg His Gin Leu Pro Leu Lys Gin Glu Lys Ala Arg Asn Asp 
65 70 75 80 

lie Tyr Asn Leu Gly lie Leu lie Ser Gin Glu Ser Lys Gly Phe lie 
85 90 95 

Gin Arg lie Asp Asn Ala Tyr Ser Leu Glu Asn Val Ser Asp He Val 
100 105 110 

Asn Glu Ala Gin Ala Leu Tyr Lys Arg Asn Tyr Asp Leu Phe Glu Lys 
115 120 125 

He Lys Ser Thr Arg Asp Lys Val Gin Val Leu Leu Ala Ser His Gin 
130 135 140 

Asp Asn Thr Asp Leu Lys Asn Phe Tyr Ala Glu Leu Asp Asp Met Tyr 
145 150 155 160 

Glu His Val Tyr Leu Asn Glu Ser Arg Val Glu Ala He Asn Arg Asn 
165 170 175 

He Gin Lys Tyr Asn 
180 



<210> 21 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 21 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 

<210> 22 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 22 

ggcaatgttc tagagcgtcg tcaacgcgat gttgagaata agagccaa 



48 
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<210> 23 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 23 

ggcaatgttt tagagcgtcg tcaacgtgat gcggaaaaca agagccaa 48 

<210> 24 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 24 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 

<210> 25 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 25 

ggcaatgttc tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 4 8 

<210> 26 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 26 

ggcaatgttc tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 48 

<210> 27 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 

<400> 27 

ggtaatgttc tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 

<2a0> 28 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 28 

ggtaatgttc tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 4 8 

<210> 29 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 29 

ggtaatgttc tagagcgtcg tcaacgcgat gttgagaata agagccaa 



48 
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<210> 30 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 30 

ggcaatgttt tagagcgtcg tcaacgtgat gcggaaaaca agagccaa 48 



<210> 31 

<211> 48 

<212> DNA 

<2 13 > Streptococcus 



agalactiae 



<400> 31 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 

<210> 32 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 32 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 



<210> 33 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 33 

ggcaatgttc tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 



<210> 34 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 34 

ggcaatgttc tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 



<210> 35 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 35 

ggcaatgttc tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 48 



<210> 36 

<211> 48 

<212> DNA 

<213> Streptococcus 



agalactiae 




PCT/EP2003/011436 



48 



<210> 37 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 37 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 



<210> 38 

<211> 48 

<212> DNA 

<2X3> Streptococcus agalactiae 



<210> 39 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 39 

ggcaatgttt tagagcgtcg tcaacgtgat gcggaaaaca agagccaa 48 

<210> 40 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<210> 41 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 41 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 

<210> 42 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 38 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 



48 



<400> 40 

ggcaatgttt tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 



48 



<400> 42 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 



48 



<210> 43 
<211> 48 
<212> DNA 
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<213> streptiococcus agalactiae 



<400> 43 

ggtaatgttc cagagcgtcg Ccaacgcgat gttgaaaata aaagccaa 



48 



<210> 44 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 44 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 



<210> 45 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<210> 46 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 46 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 48 



<210> 47 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<210> 48 
<211> 48 
<212> DNA 

<2a3> Streptococcus agalactiae 
<400> 48 

ggcaatgttt tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 48 



<210> 49 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 45 

ggtaatgttc tagagcgtcg tcaacgcgat gttgaaaata aaagccaa 



48 



<400> 47 

ggtaatgttc tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 



48 



<400> 49 

ggcaatgttc tagagcgtcg tcaacgtgat gctgaaaaca aaagccaa 



48 



<210> 50 
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<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 50 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 



<210> 51 

<211> 48 

<212> DNA 

<2 13 > Streptococcus 



agalactiae 



<400> 51 

ggcaatgttt tagagcgtcg tcaacgtgat gctgaaaaca gaagccaa 



<210> 52 

<211> 48 

<212> DNA 

<213> Streptococcus 



agalactiae 



<400> 52 

ggcaatgttt tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 



<210> 53 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 53 

ggtaatgttc tagagcgtcg tcaacgtgat gcggaaaaca agagccaa 



<210> 54 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 54 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 



<210> 55 

<211> 48 

<212> DNA 

<213> Streptococcus 



agalactiae 



<400> 55 

ggcaatgttt tagagcgtcg tcaacgcgat gttgagaata agagccaa 



<210> 56 

<211> 48 

<212> DNA 

<213> Streptococcus 



agalactiae 



<400> 56 

ggcaatgttt tagagcgtcg tcaacgtgat gcggaaaaca agagccaa 
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<210> 57 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 57 

ggtaatgttc tagagcgtcg tcaacgtgat gcggataaca agagccaa 48 

<210> 58 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 58 

ggcaatgttc tagaacgtcg tcaacgcgat gtagaaaaca gaagccaa 48 

<210> 59 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 59 

ggcaatgttc tagagcgtcg tcaacgcgat gcggataaca agagccaa 48 

<:210> 60 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 60 

ggcaatgttt tagagcgccg ccaacgcgat gcagaaaaca aaagtcag 48 

<210> 61 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 61 

ggcaatgttc tagaacgtcg tcaacgtgat gttgagaata agagccaa 48 

<210> 62 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 62 

ggcaatgttc tagagcgtcg ccaacgtgat gcagaaaaca aaagtcag 48 

<210> 63 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 63 
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ggtaatgttc tagagcgtcg tcaacgcgat gcagataaca agagccaa 48 

<210> 64 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 64 

ggcaatgttc tagaacgtcg tcaacgtgat gttgagaata agagccaa 48 

<210> 65 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 65 

ggcaatgttc tagaacgtcg tcaacgtgat gttgagaata agagccaa 48 

<210> 66 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 66 

ggcaatgttc tagagcgtcg ccaacgtgat gcagaaaaca aaagtcag 48 

<210> 67 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 67 • 

ggtaatgttc tagagcgtcg tcaacgcgat gcagataaca agagccaa 48 

<210> 68 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 68 

ggtaatgttc tagaacgtcg tcaacgcgat gtggaaaaca aaagtcag 48 

<210> 69 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 69 

ggcaatgttc tagagcgtcg ccaacgtgat gttgagaaca agagccaa 48 

<210> 70 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 
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<400> 70 

ggcaatgttt tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 



48 



<210> 71 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 71 

ggtaatgttc tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 4 8 



<210> 72 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<210> 73 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 73 

ggcaatgttc tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 



<210> 74 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 74 

ggtaatgttt tagaacgtcg tcaacgcgat gttgagaaca agagccaa 48 



<210> 75 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 75 

ggtaatgttt tagagcgtcg ccaacgtgat gcggaaaaca aaagtcag 



<210> 76 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 76 

ggcaatgttt tagagcgtcg tcaacgtgat gcagaaaaca gaagccaa 



<400> 72 

ggtaatgttc tagagcgtcg tcaacgtgat gcggaaaaca agagccaa 



48 



<210> 77 
<211> 48 
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<212> DNA 

<213> Streptococcus agalactiae 
<400> 77 

ggtaatgttc tagagcgtcg tcaacgcgat gttgagaata agagccaa 48 

<210> 78 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 78 

ggcaatgttc tagagcgtcg tcaacgcgat gttgagaata agagccaa 4 8 

<210> 79 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 79 

ggtaatgttc tagagcgtcg tcaacgcgat gttgagaata agagccaa 4 8 

<210> 80 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 80 

ggtaatgttc tagagcgtcg tcaacgtgat gcggaaaaca agagccaa 48 

<210> 81 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 81 

ggcaatgttc tagagcgtcg tcaacgcgat gcagaaaaca gaagccaa 48 

<210> 82 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 82 

ggtaatgttt tagagcgtcg ccaacatgat gttgagaata agagtcaa 48 

<210> 83 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 83 

ggtaatgttc tagagcgtcg ccaacgtgat gcggataaca agagccaa 



48 
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<210> 84 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 84 

ggtaatgttt tagagcgtcg ccaacgtgat gcagataaca aaagtcag 48 



<210> 85 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<210> 86 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 86 

ggtaacgttc tagagcgtcg ccaacgcgat gctgataaca agagccaa 48 



<210> 87 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<210> 88 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 88 

ggtaatgttc tagagcgtcg ccaacgcgat gttgataaca agagccag 48 



<210> 89 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 85 

ggcaatgttc tagaacgtcg ccaacgtgat gttgataaca agagccaa 



48 



<400> 87 

ggtaatgttt tagagcgccg ccaacgcgat gcagataaca aaagtcaa 



48 



<400> 89 

ggtaatgttt tagagcgtcg ccaacgcgat gcagataaca aaagtcag 



48 



<210> 90 
<211> 48 
<212> DNA 



<213> Streptococcus agalactiae 



<400> 90 

ggtaatgttt tagagcgtcg ccaacgcgat gttgataaca aaagccaa 



48 
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<210> 91 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 91 

ggtaatgttt tagagcgtcg ccaacgtgat gctgataaca aaagtcag 48 

<210> 92 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 92 

ggcaatgttc tagagcgtcg ccaacgtgat gcggataaca aaagccaa 48 

<210> 93 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 93 

ggtaatgttc tagagcgtcg ccaacgcgat gcggataaca aaagtcag 4 8 

<210> 94 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 94 

ggcaatgttt tagagcgtcg ccaacgtgat gctgataaca aaagtcaa 48 

<210> 95 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 95 

ggtaatgttc tagagcgtcg ccaacgcgat gcagataaca aaagccaa 48 

<210> 96 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 96 

ggtaatgttc tagagcgtcg ccaacgcgat gctgataaca aaagtcaa 48 

<210> 97 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 97 
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ggtaatgttc tagagcgtcg ccaacgtgat gctgataaca agagccaa 48 

<210> 98 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 98 

ggcaatgttc ttgagcgtcg tcaacgcgat gtcgataaca aaagtcag 4 8 

<210> 99 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 99 

ggtaatgttt tagagcgtcg ccaacgtgat gcggataaca agagtcaa 4 8 

<210> 100 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 100 

ggtaatgttt tagagcgtcg ccaacgcgat gcggataaca agagccaa 48 

<210> 101 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 101 

ggtaatgttt tagagcgtcg ccaacgcgat gcggataaca agagtcaa 48 

<210> 102 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 102 

ggtaatgttt tagagcgtcg ccaacgcgat gcggataaca agagccaa 4 8 

<210> 103 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 

<400> 103 

ggtaatgttt tagagcgtcg ccaacgcgat gcagataaca aaagtcaa 4 8 

<210> 104 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 
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<400> 104 

ggtaatgttt tagagcgtcg ccaacgcgat gctgataaca agagccaa 4 8 

<210> 105 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 105 

ggtaatgttt tagagcgtcg tcaacgtgat gcagataaca aaagtcag 48 

<210> 106 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 106 

ggcaatgttt tagagcgtcg tcaacgtgat gcggataaca agagccaa 48 

<210> 107 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 107 

ggtaatgttt tagagcgtcg ccaacgtgat gcggataaca agagccag 48 

<210> 108 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 108 

ggcaatgttc tagaacgtcg tcaacgtgat gcggataaca agagccaa 48 

<210> 109 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 109 

ggtaacgttt tagagcgtcg ccaacgtgat gcggataaca agagccag 48 

<210> 110 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 110 

ggcaatgttt tagagcgccg ccaacgcgat gcagataaca aaagtcaa 48 

<210> 111 
<211> 48 
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<212> DNA 

<213> Streptococcus agalactiae 
<400> 111 

ggtaatgttc tagagcgtcg ccaacgcgat gcagataaca agagccag 48 

<210> 112 
<211> 48 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 112 

ggtaatgttc tagagcgtcg ccaacgcgat gcggaaaaca aaagtcaa 48 

<210> 113 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 113 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 114 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 114 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 115 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 115 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 116 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 116 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 117 
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<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 117 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 118 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 118 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 119 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 119 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 120 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 120 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 121 
<211> 16 

<213> Streptococcus agalactiae 
<400> 121 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 122 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 



agalactiae 



<400> 122 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
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1 



5 



10 



15 



<210> 123 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 123 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 124 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 124 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 125 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 125 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 126 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 126 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 127 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 127 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 128 
<211> 16 
<212> PRT 
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<213> Streptococcus agalactiae 
<400> 128 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 129 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 129 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 130 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 130 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 131 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 131 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 132 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 132 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 133 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 

<400> 133 



agalactiae 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 
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<210> 134 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 134 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 135 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 135 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 136 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 136 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 137 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 137 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 138 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 138 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 139 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 
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<400> 139 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 

<210> 140 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 140 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 141 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 141 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 142 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 142 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 143 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 143 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 144 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 144 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 
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<2a0> 145 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 145 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 146 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 146 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 147 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 147 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 148 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 148 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 149 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 149 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 150 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 



agalactiae 
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<400> 150 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Arg Ser Gin 



<210> 151 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 151 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 152 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 152 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
1 5 10 15 



<210> 153 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 153 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 154 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 154 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 155 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 155 



1 



5 



10 



15 



Gly Asn val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 
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<210> 156 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 156 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 157 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 157 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 158 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 158 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
IS 10 15 



<210> 159 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 159 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 160 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 160 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 



<210> 161 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 161 



1 



5 



10 



15 
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Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 



<210> 162 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 162 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 163 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 163 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 164 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<406> 164 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 165 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 165 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 166 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 166 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



1 



5 



10 



15 



<210> 167 
<211> 16 
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<212> PRT 

<2 13 > Streptococcus agalactiae 
<400> 167 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 168 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 168 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 169 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 169 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 170 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 170 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 171 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 171 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 172 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 



<400> 172 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
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10 



15 



<210> 173 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 173 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 174 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 174 

Gly Asn Val Leu Glu Arg Arg Gin His Asp Val Glu Asn Lys Ser Gin 
15 10 15 



<210> 175 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 175 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 176 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 176 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 177 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 177 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin 
15 10 15 



<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
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<400> 178 

Gly Asn Val Leu Glu Arg Arg Gin 
1 5 



Arg Asp Ala Asp Asn Lys Ser Gin 
10 15 



<210> 179 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 179 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 180 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 180 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin 
15 10 15 



<210> 181 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 181 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 182 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 182 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin 
15 10 15 



<210> 183 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 

<400> 183 



agalactiae 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 
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<210> 184 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 184 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 185 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 185 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 186 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 186 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 187 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 187 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 IS 



<210> 188 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 188 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 189 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 



agalactiae 



<400> 



189 



wo 2004/035618 PCT/EP2003/011436 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
IS 10 15 



<210> 190 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 



agalactiae 



<400> 190 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Val Asp Asn Lys Ser Gin 



10 



15 



<210> 191 

<211> 16 

<:212> PRT 

<2 13 > Streptococcus 



agalactiae 



<400> 191 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 192 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 



agalactiae 



<400> 192 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 193 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 



agalactiae 



<400> 193 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 194 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 194 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 195 



wo 2004/035618 




PCT/EP2003/011436 



<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 195 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 196 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 196 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 IS 



<210> 197 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 197 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 198 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 198 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 199 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 199 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 200 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 



<400> 



200 
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Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 201 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 201 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 202 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 202 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 203 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 203 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Asp Asn Lys Ser Gin 
15 10 15 



<210> 204 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 204 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Lys Ser Gin 
15 10 15 



<210> 205 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 205 

Gly Leu Ser Gin Asn Arg Asp Val Arg Glu Asn Gin Arg Ala Arg Glu 
15 10 15 



<210> 206 
<211> 16 
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<212> PRT 

<213> Streptococcus agalactiae 
<400> 206 

Ala Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 207 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 207 

Gly Ala Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 208 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 208 

Gly Asn Ala Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 209 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 209 

Gly Asn Val Ala Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 210 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 210 



Gly Asn Val Leu Ala Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 211 

<211> 16 

<212> PRT 

<2 13 > Streptococcus 

<400> 211 



agalactiae 



Gly Asn Val Leu Glu Ala Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
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10 



15 



<210> 212 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 212 

Gly Asn Val Leu Glu Arg Ala Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 213 
<211> .16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 213 

Gly Asn Val Leu Glu Arg Arg Ala Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 214 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 214 

Gly Asn Val Leu Glu Arg Arg Gin Ala Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 215 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 215 

Gly Asn Val Leu Glu Arg Arg Gin Arg Ala Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 216 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 

<400> 216 



Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Gin 
15 10 15 



<210> 217 
<211> 16 
<212> PRT 
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<213> Streptococcus agalactiae 
<400> 217 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Ala Asn Arg Ser Gin 
15 10 15 



<210> 218 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 218 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Ala Arg Ser Gin 
15 10 15 



<210> 219 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 219 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Ala Ser Gin 
15 10 15 



<210> 220 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 220 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ala Gin 
15 10 15 



<210> 221 
<211> 16 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 221 

Gly Asn Val Leu Glu Arg Arg Gin Arg Asp Ala Glu Asn Arg Ser Ala 
15 10 15 



<210> 222 

<211> 16 

<212> PRT 

<213> Streptococcus agalactiae 
<220> 

<221> misc_f eature 

<222> (2) . . (2) 

<223> N, S or T 
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<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(5) . . (5) 

X can be A, E, M or Q 



mis cofeature 
(8) . . (8) 

X can be any amino acid 



misc_f eature 
(9) . . (9) 

X can be K, R or W 



mi s c_f ea tur e 
(10) . . (10) 

X can be A, D, E, II or Q 



mis cofeature 
(11) (11) 

X can be A, P, I, L, V or Y 



misc_f eature 
(12) . . (12) 

X can be any amino acid 



misc_feature 
(13) (13) 

X can be any amino acid 



misc_feature 
(14) . . (14) 
X can be K or R 



mi sc_f eature 
(15) . . (15) 

X can be any amino acid 



misc_f eature 
(16) . . (16) 

X can be any amino acid 
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<400> 222 

Gly Xaa Val Leu Xaa Arg Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 



<210> 223 
<211> 28 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 223 

gtcctgtatc tgccatggat agtgttgg 28 

<2a0> 224 
<211> 29 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 224 

ccgcggatcc acattttgat catcacctg 29 

<210> 225 
<211> 28 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 225 

gtcctgtatc tgccatggat agtgttgg 28 

<210> 226 
<211> 27 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 226 

ccgcggatcc cctataagtt gacctac 27 

<210> 227 
<211> 30 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 227 

Thr Gly Cys Thr Thr Thr Gly Cys Cys Ala Thr Gly Gly Thr Ala Gly 
15 10 15 

Gly Thr Cys Ala Ala Cys Thr Thr Ala Thr Ala Gly Gly Gly 
20 25 30 



<210> 228 
<211> 29 
<212> DNA 
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<213> Streptococcus agalactiae 



<400> 228 

ccgcggatcc acattttgat catcacctg 



29 



<210> 229 
<211> 29 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 229 

gtgccttgcc atggaaagta ccgtaccgg 29 



<210> 230 
<211> 32 

<212> DNA 

<213> Streptococcus agalactiae 
<400> 230 

gcggacagct cgagtttccc acctgtcatc gg 32 



<210> 231 
<211> 33 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 231 

gtgccttgcc atggacgacg taacaactga tac 33 



<210> 232 
<211> 31 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 232 

gcggacagct cgagtgtacc aataccacct g 31 



<210> 233 
<211> 30 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 233 

gtgccttgcc atgggccggg ataactaaag 30 



<210> 234 
<211> 33 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 234 

gcggacagct cgagctcttt tatacgccat gag 33 



<210> 235 
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<211> 30 
<212> DNA 

<213> Streptococcus agalactiae 



<400> 235 

ccgcggatcc gatgataact ttgaaatgcc 



30 



<210> 236 
<211> 30 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 236 

tggcacaagc ttacattctg agcagaaagc 30 



<210> 237 
<211> 15 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 237 

aatatcgccc tgagc 15 



<210> 238 
<211> 16 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 238 

ggttttccca gtcacg 16 



<210> 239 
<211> 28 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 239 

gtcctgtatc tgctatggat agtgttgg 28 



<210> 240 
<211> 19 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 240 

acattttgat catcacctg 19 



<210> 241 

<211> 19 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 241 

actgctgagc taacaggtg 



19 
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<210> 242 
<211> 20 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 242 

acatcacctg acaatgtcgc 20 

<210> 243 
<211> 20 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 243 

gcgattgtga atagaatgag 20 

<2X0> 244 
<211> 19 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 244 

tatacaaagc ctgagcttc 19 

<210> 245 
<211> 20 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 245 

ttaccgtagc ctgtatcacc 20 

<210> 246 
<211> 18 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 246 

cgacctacga tagcaacg 18 

<210> 247 
<211> 27 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 247 

ccgcggatcc gaatatgcta ccatcac 27 

<210> 248 

<211> 39 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 248 

cccatccact aaacttaaac attcctgatt tccaagttc 



39 



wo 2004/035618 



69 



<210> 249 
<211> 38 
<212> ONA 

<213> Streptococcus agalactiae 
<400> 249 

tgtttaagtt tagtggatgg ggctgcggtt tgagacgc 



<210> 250 

<211> 30 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 250 

tggcacaagc tttacctgct gagcgacttg 



<210> 251 

<211> 19 

<212> DNA 

<2 13 > Streptococcus 



agalactiae 



<400> 251 

gttaaaggta acctgcctg 



<210> 252 

<211> 48 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 252 

cccatccact aaacttaaac atacaactcc tattgtgccg aaatgtcg 



<210> 253 

<211> 42 

<212> DNA 

<213> Streptococcus 



agalactiae 



<400> 253 

tgtttaagtt tagtggatgg gcacttagag attttccaat cc 



<210> 254 

<211> 17 

<212> DNA 

<213> Streptococcus agalactiae 



<400> 254 
gacatcatag atccacc 



<210> 255 

<211> 29 

<212> DNA 

<213> Streptococcus 



agalactiae 
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<400> 255 

ccgcggatcc ggagctacgt ttgaacttc 



<210> 256 
<211> 39 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 256 

cccatccact aaacttaaac aatattaccg cagcaccac 



<210> 257 
<211> 39 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 257 

tgtttaagtt tagtggatgg gacaagaagg ccaagaagg 



<210> 258 

<211> 34 

<212> DNA 

<2 13 > Streptococcus 



agalactiae 



<400> 258 

cacgcaacgc gtcgacgcac agctttaact gtac 



(12) INTERNATIO 



LIGATION PUBLISHED UNDER THE PA 



Bejjdpcr/PTO 15 m ms 

LTEnQdOPERATION treaty (PCT) 



(19) World Intellectual Property 
Organization 
International Bureau 

(43) International Publication Date 
29 April 2004 (29,04.2004) 




1659 
iimniiBiniii 



(10) International Publication Number 

PCX wo 2004/035618 A3 



(51) International Patent Classification^: C07K 14/315 



(21) International Application Number: 



PCT/EP2003/0 11436 



(22) International Filing Date: 15 October 2003 (15.10.2003) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 
02023141.1 
03006393.7 



15 October 2002 (15.10.2002) EP 
20 March 2003 (20.03.2003) EP 



(71) Applicant (for all designated States except US): INTER- 
CELL AG [AT/AT]; Campus Vienna Biocenter 6, A- 1030 
Wien (AT). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): REINSCHEID, Di- 
eter, J. [DE/DE]; Albrecht Durer-Str. 23, 89231 Neu-Ulm 
(DE). GUTEKUNST, Heike [DE/DE]; Sachsenring 83, 
88400 Biberach (DE). SCHUBERT, Axel [DE/DE]; Jo- 
hannStrauss-Str. 25, 89231 Neu-Ulm (DE). EIKMAIWS, 
Bernhardt J. [DE/DE]; Gleisselstetten 49, 89069 Ulm 
(DE). MEENKE, Andreas [DE/AT]; Piettegasse 26/1. 
A-3013 Pressbaum (AT). 

(74) Agent: BOUMANN, Armin, K.; Bohmann & Loosen, 
Sonnestrasse 8, 80331 Munchen (DE). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, EG, ES, FX, GB, GD, GE, 
GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, 
KZ, LC, LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, 
MN, MW, MX, MZ, NI, NO, NZ, OM, PG, PH. PL, PT, 
RO. RU. SC. SD, SE. SG, SK, SL, SY, TJ. TM. TN. TR, 
TT. TZ, UA, UG, US, UZ, VC. VN. YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL. SZ, TZ. UG. ZM. ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD. RU. TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ. DE. DK, EE. 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO. 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM. 
GA, GN, GQ, GW. ML. MR, NE, SN, TD, TG). 

Published: 

— with international search report 

— before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments 

(88) Date of publication of the international search report: 

30 September 2004 

For two-letter codes and other abbreviations^ refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



5 

00 
IT) 



(54) Title: NUCLEIC ACIDS CODING FOR ADHESION FACTOR OF GROUP B STREPTOCOCCUS, ADHESION FACTORS 
OF GROUP B STREPTOCOCCUS AND FURTHER USES THEREOF 



(57) Abstract: The present invention is related to nucleic acids coding for adhesion factors of group B streptococcus, adhesion 
factors of group B streptococcus and uses thereof. More particularly, the present invention is related to a polypeptide being such 
adhesion factors and comprising an amino acid sequence, whereby the amino acid sequence is selected from the group comprising 
SEQ ID NO 11 to SEQ ID NO 20, and the use of such polypeptide for the manufacture of a vaccine. 



iNT^IATIONAL SEARCH REPORT XZ^hn : 

^l^r ^Btol^^pbnal Application No 

p>CT/EP 03/11436 



A. CUASSIRCATION OF SUBJECT MATTER 

IPC Iv C07K14/315 






According to International Patent Classification (IPC) or to both national classification and IPC 




B. FiELDS SEARCHED 


l^lnlmum documentation searched (ctassificatlon system followed by classification symt>ots) 

IPC 7 C07K 


Documentation searched other than minimum documentation to the extent that such docunnents are Included in the fields searched 


Electronic data t>ase consulted during the intematlonal search (name of data bas 

EPO-Internal , WPI Data, PAJ, BIOSIS, EMBL 


>e and, where practical, search terms used) 


C. DOCUIUIENTS COtylSIDERED TO BE RELEVANT 


Category * 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


WO 00/06736 A (HANNIFFY SEAN BOSCO ;LE 
PAGE RICHARD WILLIAM FALLA (GB); WELLS 
JER) 10 February 2000 (2000-02-10) 
abstract; figure 1 


1,3-27, 
29-37 


X 


GLASER ET AL.: "Genome sequence of 
Streptococcus agalactiae, a pathogen 
causing Invasive neonatal disease" 
MOLECULAR MICROBIOLOGY, 
vol . 45, no. 6, 

27 September 2002 (2002-09-27), page 
1499-1513 XP002268222 
the whole document 


1-37 






./__ 




[ Xl Fuither documents are listed in the continuation of box C. 


|)( 1 Patent famSy members are listed in annex. 


° Special categories of dted documents : 

*A* document defining the general state of the art which Is not 
considered to be of particular relevance 

'E* earlier document but published on or after the intemationai 

filing date 

'L* document which may throw doubts on priority clalm(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as speckled) 

'O' document refenlng to an oral disclosure, use, exhibition or 
other means 

document published prior to the International filing date but 
later than the priority date claimed 


later document published after the international filing dale 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

*X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

•y document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person sKltled 
In the art 

document member of the same patent family 


Date or the actual compfelion of the fritemationat seamh 

1 June 2004 


Date of maSing of the intematlonal search report 

0 4. 08. 04 


Name and mailing address of the iSA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280 HV Rrjswi]i< 
TeL (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 


Authorized officer 

Kalsner, I 



Form PCT/ISA/210 (second sheet) (January 2004) 



lATIONAL SEARCH REPORT 



f PCT/EI 



Vbnal Application No 

rPCT/EP 03/11436 



C.(Continuation) DOCUAAENTS CONSIDERED TO BE RELEVANT 



CategoiV " 



Cttatton of document, wtth indication, where appropriate, of the relevant passages 



Relevant to claim No. 



p,x 



P.X 



TETTELIN HERVE ET AL: "Complete genome 

sequence and comparative genomic analysis 

of an emerging human pathogen, serotype V 

Streptococcus agal acti ae" 

PROCEEDINGS OF THE NATIONAL ACADEMY OF 

SCIENCES OF THE UNITED STATES, 

vol. 99, no, 19, 

17 September 2002 (2002-09-17), pages 
12391-12396, XP002268223 
September 17, 2002 
ISSN: 0027-8424 
the whole document 

MEEHAN M ET AL: "AFFINITY PURIFICATION 
AND CHARACTERIZATION OF A 
FIBRIN0GEN-BINDIN6 PROTEIN COMPLEX WHICH 
PROTECTS MICE AGAINST LETHAL CHALLENGE 
WITH STREPTOCOCCUS EQUI SUBSP. EQUI" 
MICROBIOLOGY, SOCIETY FOR GENERAL 
MICROBIOLOGY, READING, GB, 
vol. 144, no. 4, 1998, pages 
993-1003-1130, XP000906842 
ISSN: 1350-0872 
the whole document 

SCHUBERT AXEL ET AL: "A fibrinogen 
receptor from group B Streptococcus 
Interacts with fibrinogen by repetitive 
units with novel ligand binding sites" 
MOLECULAR MICROBIOLOGY, 
vol. 46, no. 2, 

24 October 2002 (2002-10-24), pages 
557-569, XP002268224 
ISSN: 0950-382X 
the whole document 

DATABASE EMBL 'Online! 

6LASER ET AL. : "Streptococcus agal act 1ae 

genome sequence, use for developing 

vaccines, diagnostic tools aand for 

identifying therapeutic targets" 

retrieved from EMBL 

Database accession no. AX602133 

XP002268225 

& wo 02/092818 A (INSTITUT PASTEUR, CENTRE 
NATIONAL DE LA RECHERCHE SCIENTIFIQUE 
(CNRS) 21 November 2002 (2002-11-21) 
claims 
page 1-46 

-/-- 



1-8 



1,3-27, 
29-37 



1,3-27, 
29-37 



1-37 



Foim PCT/1SA^10 (continuation of second sheet) (Januaiy 2004] 



ATIONAL SEARCH REPORT 



Mfer^^Hbnal AppMcatfon No 

y PCT/EP 03/11436 



C^Cpntinuation} DOCUMENTS CONSIDERED TO BE RELEVANT 

Category ^ Cftalton of documem, with Indication, wtiere appropriate, of the relevant passages 



Relevant to clalnn No. 



DATABASE EMBL- 'Online! 

1 July 2002 (2002-07-01) 
TELFORD ET AL.: "Nucleic acids and 
proteins from Streptococcus groups a & b" 
Database accession no. CQ655069 
XP002281290 

abstract 

& WO 02/34771 A (CHIRON S.P.A.) 

2 May 2002 (2002-05-02) 
SEQ ID NO: 12026 
abstract 

OSAKI M ET AL: "CHARACTERIZATION OF 
STREPTOCOCCUS SUIS GENES ENCODING PROTEINS 
HOMOLOGOUS TO SORTASE OF GRAM-POSITIVE 
BACTERIA" 

JOURNAL OF BACTERIOLOGY, WASHINGTON, DC, 
US, 

vol. 184, no. 4, February 2002 (2002-02), 
pages 971-982, XP001156906 
the whole document 

JACOBSSON KARIN: "A novel family of 
f Ibrlnogen-blnding proteins In 
Streptococcus agal acti ae . " 
VETERINARY MICROBIOLOGY, 
vol . 96, no. 1, 

8 October 2003 (2003-10-08), pages 
103-113, XP002281289 
ISSN: 0378-1135 
the whole document 



2-6, 

8-13, 
16-37 



2-6, 

8-13, 

16-37 



2-6, 

8-13, 

16-37 



Foim PCT/ISA/SIO (contlnualion of second alwsl) (Januaiy ao04) 



INTERNATIONAL SEARCH REPORT 



Intematbnal application No. 
PCT/EP 03/11436 



Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 

.This intemaUonal Search Report has not been estabflshed in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. rn Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 



nn Claims Mos.: 

because they relate to parts of the International Application that do not comply with the prsscritaed requirements to such 
an extent that no meaningful Intamational Search can be carried out. specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. Q ClabnsMos.: 

because they are dependent claims and are not drafted in accordance wifti the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of Invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple Inventions In this international application, as follows: 

see additional sheet 



1 . I Y I As ali required addHionai search fees were timely paid by the applicant, this International Search Report covers 
'-^ searchable claims. 



all 



2. \^ As ali searchable claims could be searched without effort justifying an additional fee. this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this international Search Report 
* — > covers only those claims for which fees were paid, specifically claims NoSw: 



4. No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; It is covered by claims Nos.: 



RemarIc on Protest 



j I The adcfitlonal search fees were accompanied by the applicant's protest 
I X I Mo protest accompanied the payment of additional search feee. 



Form PCT/ISA/210 (continuation of first sheet (1)) (July 1 998) 



International AppllcaHon No. PCT/ EP 03/11436 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
Inventions in this international application, as follows: 

1. Claims: 1, 7, 14, 15 (completely); 3-6, 8-13, 16-27, 
29-37 (partially) 



An isolated nucleic acid molecule, encoding a fibrinogen 
binding protein, comprising a nucleic acid having at least 
70% identity to a nucleic acid sequence of SEQ ID NO, 1, 2, 
3, 4, 5 or 6; an Isolated nucleic acid molecule encoding a 
polypeptide comprising the amino acid motive of SEQ ID NO. 
222, a polypeptide comprising the amino acid sequence 
selected from SEQ ID NO. 113-205 or SEQ ID NO. 222; 
process for producing such polypeptides, pharmaceutical 
compositions comprising the polypeptides, antibody, use of 
the polypeptide or antibody; methods for identifying an 
antagonist, process for in vitro diagnosing, affinity dlvice 
comprising such polypeptide; uses of the polypeptides 



2. Claims: 2, 3-6, 8-13, 16-27, 29-37 ( all partially) 

An isolated nucleic acid molecule encoding an adhesion 
factor comprising a nucleic acid having at least 70% 
identity to a nucleic acid sequence of SEQ ID NO: 7-10; a 
polypeptide encoded by such nucleic acid molecule; process 
for producing such polypeptides, pharmaceutical compositions 
comprising the polypeptides, antibody, use of the 
polypeptide or antibody; methods for identifying an 
antagonist, process for in vitro diagnosing, affinity divlce 
comprising such polypeptide; uses of the polypeptides 
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Claim 28 refers to an antagonist identified by a method according to 
cliams 26 or 27 without giving a true technical characterization. 
Moreover, no such compounds are defined in the application. In 
consequence, the scope of said claims is ambiguous and vague, and their 
subject-matter is not sufficiently disclosed and supported (Art. 5 and 5 
PCT). No search can be carried out for such purely speculative claims 
whose wording is, in fact, a mere recitation of the results to be 
achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions 1n respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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